
y c u y i ^'t) 


J. M. Harker 
D. W. Brede 
R. E. Pattison 
G. R. Santana 
L. G. Taft 


A Quarter Century of Disk File Innovation 


This paper traces the development of disk file technology from the first disk drive to the present. A number of innovative 
advances are reviewed in the evolution of mechanical design, materials, and processes. These advances constitute the 
technological base that has permitted almost four orders of magnitude of improvement in areal density; they are 
discussed from four interrelated aspects: the magnetic head and its air bearing support ; the head positioning actuator; 
the disk substrate and its magnetic coating; and the read/write signal detection and clocking electronics. 


Introduction 

The first issue of the IBM Journal of Research and 
Development in 1957 featured two papers, one describing 
the IBM 305 RAM AC system [ 1 ] and the other describing 
the IBM 350 disk file [2]. As Stevens mentions in his 
overview paper “The Evolution of Magnetic Storage” 
[3], the 350 was the first production movable-head disk 
drive. The purpose of this paper is to describe the 
significant innovations that led to the introduction of that 
product and to the evolution of the current generation of 
disk file products. 

Inductive magnetic recording was selected as the base 
technology for disk files because of its advantages of 
nonvolatility, immediate readback without intermediate 
processing, unlimited reversibility, and the relative low 
cost and simplicity of the transducer and recording 
medium. 

Increasing linear bit density has depended upon the 
scaling down of the three geometric parameters of head- 
to-disk spacing, read/write gap length, and disk magnetic 
coating thickness. Progress in reducing these key geomet- 
ric parameters and the resulting linear bit density of 
selected products is shown in Table 1. Achieving these 
reduced parameters has been an iterative process and has 
required significant innovation. For example, scaling 


down the coating thickness from 1200 microinches to 
25 microinches has required extensive process develop- 
ment in the area of substrate preparation and magnetic 
film coating, formulation, application, buffing, and polish- 
ing. Reducing the spacing between the head and the disk 
from over 800 microinches to less than 13 microinches 
has required an in-depth understanding of air bearing 
technology through analysis, simulation and testing, de- 
velopment of low-mass sliders with very stiff air bearings, 
and development of very smooth and flat disk surfaces 
with few mechanical asperities. 

Early recording head cores were made of laminated 
mu-metal with the gap formed by a copper shim. Later, 
silicon dioxide was deposited on ferrite to form smaller, 
more precise gaps, and most recently the use of thin 
permalloy films and photolithographic techniques has 
permitted extremely small and accurate dimensions for 
head gap, pole tip width, and pole tip thickness. 

Track density improvements have also been aa impor- 
tant part of advances in disk file areal density. These 
improvements have come through fabrication of smaller 
gap widths and more accurate head positioning technol- 
ogy. The resulting advances in areal density are shown in 
Table 1. 
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positioning actuator, L^SSlriS'rair* of magnetic head and its air bearing support, disk substrate and its coating, head- 


Year of first ship 
Product 

Recording density 

Areal density (Mb/in. 2 ) 
Linear bit density (bpi) 
Track density (tpi) 

Key geometric parameters 
(microin.) 

Head-to-disk spacing 
Head gap length 
Medium thickness 

Air hearing <& magnetic element 
Bearing type 
Surace contour 
Slider material 
Core material 

Slider/core bond 


Diameter (in.) 

Substrate thickness (in.) 
Rpm 

Fixed, removable 
Data surfaces/spindle 
Actuator 

Access geometry 
Heads 
Positioning 
Final position 

Actuators/spindle (max. no.) 

Avg. seek time (ms) 

Read 'write electronics 
Data rate (Kbytes/s) 
Encoding 
Detection 
Clocking 

Si.mc :i\ in prcccdinR column. 


1957 
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The evolution of the whole technology is given in the 
overvtew paper ,3,. and the progress in disk file manufac- 
turing and in selected innovations in materials, processes 
and testing ts dtscussed in the paper by Mulvanv and 
Thompson |4| The present paper traces the development 
of each part of the technologies in four key areas: 

• The development of the magnetic head and its air 
bearing support that provides the close spacing be- 
tween the dtsk surface and read/write head necessary 
tor high-dens. ty magnetic recording 
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• The mechanical design aspects of the actuator that 
positions the read/write heads over concentric tracks of 
a rotating disk. 

• The key innovations in logic and electronics required to 
read and write data reliably and accurately from a disk. 


Much of this development has been based upon the 
work of many tndividuals who have created, over the past 
quarter of a century, a technological base that has permit- 
ted the improvement of almost four orders of magnitude 
!" 3real density shown in Ta We I and also enhancements 
m performance, function, and reliability. Two individuals 
m particular deserve mention because their influence was 
so pervas.ve through the early days of development 
They certainly deserve credit for motivation and for ac- 
tive participation in many of the innovations to be dis- 
cussed. They are R. B. Johnson, who had the vision that 
such a device was needed and could be built, and 
L. D. Stevens, who provided the engineering manage- 
ment that realized the first successful product [5], 

Air bearing spacing and magnetic heads 

Ftlm bearings, both self-acting and externally pressur- 
ized. have been in common use for over a hundred years. 


They allow one surface to move relative to another, with 
a film of either a compressible or incompressible fluid 
preventing contact and therefore preventing wear. A 
common example would be the journal bearing in an 
automobile engine. The use of air as the lubricating film 
was first reported by Kingsbury in 1897 [6]. The whole 
concept behind magnetic disk files rests on use of this 
simple mechanism. The innovation was to use such a 
bearing to maintain a small and consistent spacing be- 
tween a magnetic read/write head and a disk surface 
whose axial runout was far greater than the spacing 
required. 

• Hydrostatic (pressurized) air bearings 

The event that provided the basis for the first disk file was 
accomplished on June 2, 1953, when W. A. Goddard [7] 
demonstrated the operation of an air-fed bearing with a 
magnetic element incorporated. This head, shown in Fig. 
1, had small holes drilled around a circumference on its 
face so that air flowing out provided a pressure pad to 
resist the force of a spring-loaded suspension. The mag- 
netic head mounted in the center successfully wrote and 
read bit patterns on a disk, establishing the feasibility of 
maintaining a consistent spacing between a disk surface 
and a read/write head. Of all developments related to the 
disk file, the ability to maintain a consistent spacing (and 
in subsequent generations, to diminish it) has been the 
prime driving force for all other improvements in the 
recording technology. 

N. A. Vogel took the air bearing concept and evolved 
the configuration used in the first production file. It was 
found that a bleed hole in the center of the bearing was 
necessary for stability. The force required to load this 
head onto the disk was provided by three pins acting as 
pistons connected to the same air chamber that supplied 
the holes on the bearing surface. When air pressure was 
applied, the head was forced against the disk with a load 
proportional to the support provided by the bearing. 
Light springs were provided to return the head from the 
disk to the socket when air pressure was removed or in 
the event of a failure in the air supply [8, 9]. 

This head provided a spacing of about 800 microinches 
for a separate write-wide magnetic element and a read- 
narrow magnetic element. This combination provided a 
guard band between recorded tracks to allow for the 
positioning tolerance of the moving-head assembly. 

• Hydrodynamic ( self-acting ) air bearings 

While Vogel’s air head design was improved through 
several generations of development, it became apparent 
that eliminating the disk-to-disk excess motion by placing 
one head element on each disk surface would be a 



Figure 1 First air bearing magnetic head, tested on June 2, 
1953. 


significant functional improvement. Indeed, files having 
several air-fed heads per surface were built for the IBM 
Stretch computer system. The high consumption of com- 
pressed air and the maintenance and reliability associated 
with compressors, however, made this design of limited 
applicability. 

J. J. Hagopian conceived of using a self-acting air 
bearing in such an array of magnetic head carriers. His 
experimentation showed that a reasonably flat bearing 
properly gimbaled and loaded would generate a hydrody- 
namic bearing and provide stable spacing of the magnetic 
element from the rotating disk. Based upon this concept, 
an advanced disk file project was initiated. As develop- 
ment proceeded it became apparent that the operation of 
this bearing was erratic. It would fail, at times, when the 
slider occasionally touched the disk surface. This would 
often lead to catastrophic failure of the disk surface and 
subsequent destruction of the magnetic head in its carrier. 

Classical bearing theory dealt only with an incompress- 
ible fluid. The effect of compressibility on the operation 
of such bearings was the subject of an intensive analytical 
and experimental study conducted in the summer of 1957. 
The analytical Work was done by Dr. W. A. Gross [10] 
and the experimental work was done principally by Dr. 
K. E. Haughton and R. K. Brunner [11]. The practical 
result was that an entry wedge must be provided for the 
entry of the boundary-layer air film. 

Thi s entry wedge was obtai n ed by providing a convexT 
cylindrical surface on the face of the slider, which analy- 
sis and experiment indicated should have a radius of 
curvature of about 250 inches. An innovative way to 
fabricate such a slider was suggested by Brunner. First, 
he accurately distorted concavely the stainless steel slider 
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pivot screws required adjustment and lubrication. Adjust- 
ment was disturbed by temperature variations, and oil 
sometimes found its way to the disk surface. 

This basic suspension design was improved in several 
steps, as seen in Fig. 2(a). The most significant improve- 
ment was made with the design for the IBM 2311. It 
separated the vertical load requirements from the gimbal 
system. The load was applied directly to the slider at the 
center of roll and pitch. The 23 1 1 suspension design used 
a thin fiat flexure ring to control the roll, pitch, and yaw. 
The flexure ring was cut from stainless sheet stock and 
the joints were made by spot welding, eliminating screws 
and lubrication and reducing material, fabrication, and 
assembly cost. Variants of this design were used on all 
subsequent sliders. 

The increased track density of the 23 1 1 [from 50 to 100 
tracks per inch (tpi)] was achieved in part by use of an 
improved head element configuration. This involved a 
change to a "tunnel" erase trimming of the newly written 
track [12] that provided a better signal-to-noise ratio than 
the previous guard band techniques of write-wide read- 
narrow and erase-wide write/read-narrow. Also intro- 
duced on the 231 1 head was a ball staking technique for 
mounting the magnetic element in the slider [13]. This 
technique reduced element movement within the slider 
assembly and maintained the tighter tolerances necessary 
for the higher track density. 

A ferrite magnetic head element was designed for the 
IBM 2314 disk file, first shipped in 1966, because of its 
2.5-megabit/s data rate resulting from increased bit densi- 
ty (1100 to 2200 bpi). Manufacturing process consider- 
ations dictated that these ferrite magnetic elements be 
mounted in a mechanically similar ceramic material. 
Alumina was selected because it could be formed to the 
desired circular geometry and had desirable mechanical 
properties. It provided an improved head/disk interface, 
and ceramic or ferrite materials have been used in all 
subsequent products. The ferrite element was bonded to 
the slider with an epoxy and the face of the assembly 
lapped to the required cylindrical surface. 

As the head-to-disk spacing was reduced from 85 
microinches for the 2314 to 50 microinches for the IBM 
3330, it became critical that the magnetic element gap be 
flush with the face of the slider. In addition, the thermal 
and hydroscopic properties of epoxies used to bond the 
ferrite core to the slider became inadequate. Consequent- 
ly, a compatible ferrite-ceramic-glass material combina- 
tion was developed for the 3330. This required two 
compatible glasses with enough separation in working 
temperatures to allow the separate assembly bonding of 


the parts of the core and the parts of the slider with a high- 
temperature glass. Then the final assembly of the previ- 
ously assembled core and slider was accomplished with a 
second glass at a lower temperature that did not disturb 
the dimensions established in the first bonding [14], A 
new slider ceramic was developed that had a thermal 
expansion coefficient compatible with ferrite and glass, 
and was of sufficient hardness, density, and grain struc- 
ture to provide a satisfactory bearing surface. 


The 3330 slider geometry was changed to rectangular 
and the bleed holes were replaced by a central slot [Fig. 
2(a)]. A spring element in each support arm provided the 
mechanical load for the associated slider [15]. This design 
reduced the tolerances associated with the mechanical 
load and made it possible to set the flying height of each 
slider to close limits. This combination of materials and 
geometry produced a dimensionally stable head/slider 
assembly and made possible a head-to-disk spacing of 50 
microinches for the 3330-1, first shipped in 1971, and 35 


microinches for the 3330-11, first shipped in 1974. Both 
products had a bit density of 4040 bpi. The track density 


of the Model 1 was 192 tpi and that of the Model 11 was 
370 tpi. 

The head/slider combination developed for the IBM 
2305 fixed-head file, a product complementary to the 3330 
and also first shipped in 1971, was based upon the 
concept of a batch manufacturing process developed by 
E. R. Solyst [16]. In this design, nine head elements and a 
flat slider with a taper at the leading edge were machined 
from a single block of ferrite [Fig. 2(b)]. This taper-flat 
design, with its large air bearing surface, required a 
loading force of about 1.2 kilograms to achieve a 50- 
microinch flying height. 

Low-mass, lightly loaded sliders, suspensions, and 
heads 

Winchester taper-flat slider The next major step in 
slider and head development came with the introduction 
of the IBM 3340, first shipped in 1973. This head com- 
bined the taper-flat and batch-fabrication concepts devel- 
oped for the 2305 with the experience gained at the IBM 
Los Gatos Laboratory, where a unique file had been 
developed in the late 1960s using disk technology to 
provide a refresh buffer for a high-resolution display 
system. This fixed-head file, developed by Dr. J. T. Ma, 
was the first to use lubricated iron oxide particulate disks _ ' _ 
and start-and-stop-in-contact heads. To achieve the linear 
density required for that buffer, heads developed by Data 
Disk Corporation for use on a plated disk were used. 

These heads had three small pads arranged in a triangle 

with the magnetic element at the apex providing one of 681 
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FiKure 3 IBM 3370 thin-filn, head: (a) planar view: (b) eenler 
ne cross section showing A-magnetic layers. B-pole tips 
C-conduc.or turns. D-gap layer. E-insulation layer: (c, pole 
tip geometry at the air bearing surface (from Ref. 21). 


the pads. They provided a small air bearing area and a low 
mass, were lightly loaded with about 20 grams by a 
flexure suspension, and were in contact with the disk 
surface when it was stopped. In operation they main- 
tained a nominal spacing of 20 microinches. A lubricated 
particulate disk was developed by IBM to minimize wear 
during the periods of in-contact operation. IBM obtained 
rights to produce this tri-pad head design and a new 
version was developed by W. S. Buslik for use in a disk 
dnve embedded in the IBM 3735 Programmable Buffered 
Terminal. 

Experience had shown that this tri-pad design was 
difficult and costly to manufacture, and in 1969 a develop- 
ment effort was initiated to design a head that would 
operate at close spacing but could be produced in quanti- 
ty with a simple process. The resulting tri-rail design by 
Warner [17] is known as the "Winchester” head. A small 
taper-flat bearing area, such as in the 2305, was provided 
by the outer two rails of the tri-rail design. The center rail 
defined the width of the magnetic element at the trailing 
edge where a ferrite core was formed, as in the 2305 
design. This head maintained a spacing of about 20 
micromches with a load of less than 20 grams. Mulvany 
L ] gives a more detailed discussion of the evolution of 


the Winchester head/slider for the interested reader. The 
elegant simplicity of this design has given it industry-wide 
acceptance. 

FUm m “f! ne,i c dements Based upon the prior develop- 
ment work of plated magnetic memories, work was begun 
on film recording heads in the late 1960s. Romankiw 
Croll, and Hatzakis [19] built the first operating thin-film 
head in IBM. The success of these early designs resulted 
m a development activity to integrate a film magnetic 
element with a slider bearing in a configuration suitable 
for product use. Additional contributions by Thompson 
and Romankiw [20], Jones [21], and others led to the 
design of the head that was first shipped in the IBM 3370 
in 1979. The advantages of this structure, shown in Fig 3 
are the precise control of the gap, the inherently thin pole 
tips which enhance the sharpness of the readback pulse, 
and the excellent frequency response of the element! 
These combine to give a unique potential for enhance- 
ment of disk file capacity and data rate. 

Design aspects of disk file actuators 

The key mechanical components of a moving-head disk 
file are in the actuator and carriage assembly [22]. Its role 
is to transport the read/write heads from one track 
position to another as fast and as accurately as possible. 
Speed is important because it directly affects seek time, 
which is a major component of access time. Positioning 
accuracy is important because it affects the maximum 
attainable track density. In this section, the progress of 
actuator and carriage development will be reviewed with 
emphasis on the improvements made in both seek time 
and positioning accuracy. 

• The IBM 350 actuator 

The IBM 350 disk file [2] provides a reference to measure 
progress. A schematic of the actuator and carriage of the 
350 ,s shown in Fig. 4. Two pressurized air-bearing- 
supported magnetic heads, gimbaled to a pair of arms, 
were mounted on a carriage that was moved vertically to 
one of 50 disks 24 inches in diameter and spaced 0.4 inch 
apart, by a cable connected through two counter-rotating 
magnetic powder clutches driven by a single motor. A 
null servo system and then a mechanical detent were used 
to position the carriage at the selected disk. Detenting the 
carnage changed the mode of the actuator, permitting the 
same drive system and a similar null servo to move the 
two head-arm assemblies to one of 100 concentric tracks 
in a five-inch radial band. Upon arrival at the selected 
track, air pressure was applied to detent the arm into its 
final position and load the air bearing heads against the 
isk surface preparatory to reading or writing. The de- 
tenting system provided a track density of 20 tpi. The 
average seek time for this two-dimensional actuator was 
600 milliseconds. 
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Hydraulic actuators and comb access structures 

The next 24-inch-diameter disk file, the IBM 1301, intro- 
duced the self-acting air bearing slider in place of the 
externally pressurized bearing. This made the develop- 
ment of a comb-type carriage practical, with one self- 
acting slider for each disk surface. A new hydraulic 
actuator was developed to increase the access rate, and 
the comb of heads provided a logical cylinder of data 
tracks at each access position. This actuator employed 
two sets of hydraulic cylinders, one each for coarse and 
fine positioning, and was capable of moving the carriage 
with its 50 head-arm assemblies from one logical data 
cylinder to any other with an average seek time of 165 
milliseconds. The accuracy of the hydraulic system with 
detents permitted an initial track density of 50 tpi, later 
increased to 100 tpi. 

Removable disk pack 

When it was seen that needs of small systems users could 
not be met at an acceptable cost using the large compo- 
nents described, a proposal was made for a small disk file, 
utilizing a removable disk assembly to allow off-line 
storage and backup [23]. Consequently, a disk drive was 
developed [24], using six 14-inch-diameter disks mounted 
in a customer-interchangeable disk pack enclosed in a 
protective cover that could be removed and replaced only 
when the pack was in place on the drive hub [25, 26]. The 
IBM 1311 was the first of a family of files with inter- 
changeable disk packs. This design concept was extended 
into the early 1970s. 



Figure 4 IBM 350 carriage and actuator (from Ref. 2). 


The design of an interchangeable disk pack required 
tight control of the absolute position and the position 
tolerances of the data tracks written on each pack. 
Control of the absolute position of the tracks was estab- 
lished by introducing a special disk pack for use by both 
manufacturing and field engineering. A cylinder of tracks 
was very precisely written on this pack using bit patterns 
chosen for ease in manually adjusting data heads. This 
special pack permitted all of the heads on all of the drives 
to be initially aligned to the same absolute cylinder 
location. The location of all of the other cylinders on the 
disk pack was then a function of the carriage, disk pack, 
temperature, and spindle tolerances of the disk file that 
wrote data on these cylinders. 

Table 2 lists the key tolerances affecting track position- 
ing accuracy for several file systems with interchangeable 
magnetic media. The track positioning accuracy can be 
determined by combining these tolerances statistically. 
The result is referred to as the "track misregistration 
tolerance." Track density in all disk files is determined by 
the condition that the head, disk, and recording channel 
off-track capability be greater than or equal to the track 
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The IBM 3330 was the first production disk file to 
incorporate a track-following feedback control system. 
Feasibility of a similar positioning concept was demon- 
strated by Hoagland in the early 1960s [27], The 3330 
feedback system consisted of 

• One dedicated disk surface with special prerecorded 
tracks defining the position of each data cylinder. 

• A servo head to read the prerecorded tracks. 

• An electronic control system that generated a position 
error signal and translated it into actuator drive current. 

• An actuator motor that generated force directly propor- 
tional to drive current. 



Figure 5 IBM 3330 voice coil actuator. 


assembly. An innovative hydraulic actuator was devel- 
oped that took advantage of the shorter stroke length and 
reduced carriage mass. The IBM 1311 actuator drove the 
carriage selectively at one of two speeds determined by 
fluid flow rates. A high flow rate (and high speed) was 
used for traveling long distances, and a low flow rate (and 
low speed) was selected just prior to arriving at the 
selected data cylinder. At the low speed, a mechanical 
detent on the carriage could be engaged without creating 
excessive carriage vibration and overshoot. For short 
distances, only the low speed was employed. Hydraulic 
fluid was delivered to this two-speed actuator by a small 
commercially available gear pump. This combination 
resulted in a low-cost carriage and actuator assembly with 
an average seek time of 150 milliseconds. 

The basic actuator and carriage concept of the 1311 was 
also employed on successor products, the 2311 Disk 
Drive and the 2314 File Facility. Carriage tolerance 
modifications achieved the more stringent track density 
requirements of 100 tpi. With the change from a two- 
speed to a three-speed actuator the average seek time was 
reduced from 150 milliseconds to 60 milliseconds. 

• Truck-following servo actiuitors 
The 131 1 tolerances listed in Table 2 show that the radial 
positioning accuracy of the carriage and the temperature 
effects contribute almost 40% of the total. This was so 
because the radial positioning detent was referenced to 
the main drive casting rather than a track on the disk 
pack. There was no compensation for even a portion of 
the nonuniform and time-varying effects of temperature. 
By moving the reference from the base casting to the disk 
pack and providing a feedback loop to control the car- 
riage position, a portion of the tolerance due to tempera- 
ture could be eliminated. 


The actuator was made of a wire-wound cylindrical 
bobbin supported by a carriage (Fig. 5) and moving in a 
magnetic field structure similar to a loudspeaker (hence 
the term voice coil motor*’). During seeking, current in 
the bobbin was controlled by monitoring servo track 
crossings and resulted in heads arriving at the selected 
cylinder with very little overshoot. Feedback control was 
continuously provided in the track following mode, per 
mining the servo head to follow the radial runout and 


temperature effects. The signal pattern recorded on the 
servo tracks [28] was a key factor in attaining accurate 
track following at a reasonable cost. 


Table 2. The net effect of all tolerance improvemet 


resulted in attaining 192 tpi. The servo control system 
with the voice coil motor resulted in an average seek time 
of 30 milliseconds. 


With the development of the 3330, the 13 1 1 family had 
grown from a single spindle disk file of 2.68 million bytes 
to a dual spindle file of 100 million bytes per spindle. No 
longer was this file family designed for the first-time file 
user but rather for the experienced disk file system user, 
leaving a need for a smaller-capacity and lower-cost 
entry-level disk file. To meet this need, the IBM 3340 disk 
file was developed. 

The IBM 3340 actuator and data module 

The IBM 3340 contained many innovative designs [18]. 
Key was the low-mass, lightly loaded "Winchester" head 
and lubricated disks which permitted heads to remain on 
the disk while it was starting and stopping, thus eliminat- 
ing the load/unload mechanism. Because of the low cost 
of the head/slider, two heads per surface could be used. 
This cut in half the stroke length and lowered the cost of 
both the carriage and. the actuator. The 70-megabyte 
capacity was obtained with four disks. The disks, the disk 
spindle and bearings, the carriage, and the head-arm 
assemblies were incorporated into one removable pack- 
age called a Data Module [18]. Since each head need only 
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read the data that it had written— a concept that had been 
suggested earlier by Buslik [29]— the need for a special 
disk pack and for adjusting the individual data heads was 
eliminated. The positioning accuracy tolerances are also 
shown in Table 2. The IBM 3348 Data Module attained a 
track density of 300 tpi and an access time of 25 millisec- 
onds. 

The IBM 3350, first shipped in 1976, extended the 
Winchester technology by increasing the number of disks 
per drive, the bit density to 6425 bpi, the track density to 
478 tpi, and the data rate to 1.2 Mbytes/s, resulting in a 

4.5 times increase in capacity per spindle to 

317.5 Mbytes. One of the key changes that permitted 
these improvements was the decision not to provide a 
customer-removable data module. Tolerances associated 
with removability were eliminated, allowing an increase 
in track density. Eliminating removability reduced still 
further the potential for particulate contamination due to 
handling of the data module. 

Rotary actuators 

An ingenious actuator [30] developed at the IBM Devel- 
opment Laboratory in Hursley, England, involved a 
moving-coil rotary actuator. Fig. 6, with pivoting data 
arms. Although the pivoting data arm concept was not 
new, this rotary actuator and carriage, with only one 
moving part, provided most of the advantages of the 
linear voice coil actuator [31], but at very low product 
cost. As a result this actuator using the Winchester head 
technology, combined with a nonremovable disk in a 
sealed enclosure, has been used in imbedded disk files for 
several small systems such as the IBM System/32. The 
heat generated by this small unit was low enough to be 
dissipated directly through the enclosure, without requir- 
ing external air cooling, thus providing low-cost protec- 
tion from external contamination. 

The latest small IBM disk drive developed at the 
Hursley Laboratory, the IBM 3310, also utilizes a rotary 
actuator and carriage with a dedicated servo surface. In 
addition, it employs a sectorized track-following servo 
concept [32, 33]. Each data surface has prerecorded 
feedback information located in small radial sectors 
equally distributed around the disk. During seek opera- 
tions the dedicated servo surface is used to provide 
feedback information, but when the desired cylinder is 
reached, position information from the selected data head 
is combined with the information from the dedicated 
servo surface to control the carriage position. The control 
electronics switches data and servo information to appro- 
priate circuits and samples and holds servo information 
between sectors. This sector servo control virtually elimi- 
nates temperature effects and most of the tolerances 
between the dedicated servo head and the data head. 


Substrates In early disk file development the major 
concern with the substrate was its axial runout. The first 
disk file used an aluminum substrate of “recording 
stock” quality and disks were fabricated by laminating 
two half-disks, 0.051 inch thick, together with an adhe- 
sive. The coating on these disks was relatively thick, 
1200 microinches, masking the effects of substrate imper- 
fections. 


The requirements for higher-density recording demand 
that the head be closer to the surface and the magnetic 
coating thickness be reduced. As the thickness of the 
magnetic coating and the flying height of the head/slider 
decreased, the effects of the substrate became more 
important. Various polishing, grinding, and machining 
techniques were developed to improve flatness and 
smoothness. A key advance in the fabrication of sub- 
strates was the development of a diamond tool-lathe 
process in conjunction with the use of a much purer 
aluminum alloy [4]. This process, developed by the 
Manufacturing Group in Sindelfingen, Germany, estab- 
lished a technology base that has provided substrates for 
all subsequent generations of IBM disk products. 

New and better measurement techniques were devel- 
oped. The measurements of the first and second deriva- 
tives of runout were important in predicting the excursion 
of the head over local surface imperfections. With the 
introduction of lower flying heights and increasing rota- 
tional speed of the disk, the resonant frequencies of the 
substrate became important. Rotational speed and sub- 
strate thickness were selected to avoid these frequencies. 

Magnetic coating The magnetic coating is a dispersion 
of magnetic particles in a binder. In the early 1950s the 
magnetic particles were limited to the gamma form of iron 685 
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Figure 7 Plot of time required to orient magnetic panicles 
during coating a disk, r.v the fluid viscosity of the magnetic 
medium. 


oxide, and polymer chemistry was in its infancy, provid- 
ing few choices of polymeric materials to satisfy the 
rigorous demands for a disk recording medium. The key 
demands on a binder material are 

• Ready dispersion of magnetic iron oxide particles. 

• Viscosity low enough to spin-coat on an aluminum 
substrate. 

• Curing to form a crosslinked. hard, durable material. 

• Good adhesion to the substrate. 

• Smooth surface polish with few pinholes or other 
imperfections. 

Literally thousands of combinations were tried through 
much empirical study and diligent experimentation. The 
work culminated in a unique binder system described by 
Johnson. Flores, and Vogel [34], The choice of a combi- 
nation of an epoxy, a phenolic, and a polyvinyl methyl 
ether proved to be an excellent selection and with modifi- 
cation is still the basic formulation in use today [4], 
Similarly, the original spin coating used for the first disks 
continues to be the predominate process for most disk 
production today. 

The magnetic properties of commercially available 
magnetic particles have continued to improve over the 
years. Good control in particle geometry yields materials 
that facilitate particle orientation within the recording 
medium. Structural defects within particles have also 


been improved by minimizing pores and nonmagnetic 
inclusions between the individual crystallites of a parti- 
cle, thus leading to better control over the range of 
particle-switching fields. 

Magnetic particle orientation Orientation of the oxide 
particles as routinely practiced in magnetic tape had long 
been an objective of disk development. The geometry of a 
disk and the requirements of performing the orientation 
on the spin coaler proved a difficult hurdle for many 
years. E. M. Williams found that a new iron oxide particle 
from Pfizer had high acicularity and a narrow switching 
field distribution. This oxide and the concurrent work on 
orienting particles on disks by P. T. Chang, A. W. Ward, 
and W. N. Johnson led to a new disk product with 
substantial improvement in recording properties. The 
oriented disk was first introduced in the 3340 Data 
Module and has been used in all subsequent disk file 
products. 

The key parameters for orientation are the fluid viscosi- 
ty of the media, the time required to orient the particles, 
and the magnetic field applied to the particles. It is 
necessary to carefully control both the spinoff conditions 
and duration as well as the application viscosity of the 
fluid, as indicated in Fig. 7. Because the orienting field is 
from the gap fringe field of a magnet, both the strength of 
the field and the time the particles "see" that field 
become the critical factors for good orientation. With a 
pair of opposing magnets on each side of the disk so that 
the fields are essentially in the plane of the disk, it was 
found that a gap field of 1800-2000 oersteds [I Oe = 

( 1000/4tt) A-rrT'] was required. If the field was weaker 
than that, there was insufficient force to rotate the 
particles in the viscous medium. If the field was much 
bigger than that, the particles simply switched their 
magnetization polarity rather than being rotated. It was 
also necessary to rotate the disk slowly for maximum 
orientation. If the disk were rotated too rapidly, the 
particles switched their magnetization polarity, and if too 
slowly, it caused coating imperfections. 

Lubrication The development of the low-load Winches- 
ter slider, designed to be in contact with the disk surface 
during starting and stopping, put new requirements on the 
surface characteristics of the magnetic medium. These 
requirements were met by the development of lubricating 
techniques for disks similar to those previously used on 
magnetic tape. The first lubricant used a 0.75% solution 
of Dow Corning RDC-20O in a freon solvent, and was 
applied topographically to the disk with a soft absorbent 
cloth. This disk proved to be very reliable and led to the 
development of disk file products utilizing the concept of 
a lubricated surface with start-and-stop-in-contact heads. 
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In succeeding years, numerous other lubricants were 
evaluated to meet the demanding needs of the newer high- 
performance files. 

Data encoding and channel electronics 

Advances in encoding methods and recording electronics 
have also contributed to increasing disk file areal densi- 
ties. As linear bit densities have increased from 100 bpi to 
15 000 bpi and rotational velocity from 1200 to 3600 rpm, 
the serial data rate at the read/write head has increased 
from 100 kilobits/s to 24 megabits/s. These data rates 
have, in recent generations, pressed the data rate capabil- 
ity of host system data channels, and in addition have 
provided a challenge in the design of disk file electronics. 

For writing data, the electronics converts logic-level 
data signals into a current that drives the recording head. 
Later, when data are being read, the process must be 
reversed. Analog signals induced in the coils of the read 
head must be amplified, detected, and clocked. Reliable 
recovery of stored information involves both time detec- 
tion and amplitude detection. Encoding methods have 
been designed to place more emphasis on either one or 
the other of these two dimensions of detection, depending 
upon the disk and head magnetic and electrical character- 
istics. 

The first disk file utilized NRZI encoding [35], devel- 
oped initially for tape drives. Data were recorded by 
changing the direction of current flow in the head for 
every 1-bit, with no change in current direction for aO-bit. 
Sufficient current was applied to saturate the medium. 
Since the readback signal is proportional to the rate of 
change of recorded field, a pulse was read back for every 
1-bit. Presence of the signal was detected when the head 
voltage exceeded a predetermined clipping level. This 
amplitude detection was capable of discriminating against 
background noise, base-line overshoot, and extra bit 
defects. The data-clocking circuits, developed by Seader 
[36], used two fast-starting oscillators, each started by 
alternate 1-bits. After starting, the cycles of one oscillator 
were used to clock any following 0-bits. The second 
oscillator was started when the next 1-bit was detected. 
The use of an odd redundancy bit and a space bit at the 
end of each character limited the free-running cycles of an 
oscillator to at most one character. 

The next disk file used a combination of techniques for 
developing a clock. A separate clock track provided a 
stream of periodic pulses with the rotation of the disk. 
Each bit time was divided into four phases by delaying 
the base clock three times. At the beginning of each byte 
or character of data, one of the four phases was selected 
to clock the remaining data bits. Since the clock track 


signal was locked to minor disk speed variations, and 
with the time reference re-established at the beginning of 
each byte, the expected time variation between a clock 
phase and data from a head was small. Furthermore, 
mechanical variations due to vibration and temperature 
differences between the clock head and data head, which 
could cause clocking errors, were low in frequency 
relative to the data stream. The major drawback of this 
system was the cost of a clock head and read amplifier 
together with logic to perform the phase selection. The 
amplitude detection system was improved by the addition 
of automatic gain control to eliminate the wide range of 
signal amplitude variation caused by different head and 
disk configurations. As bit densities increased, so did 
data rates, and it became more important to generate a 
clock that was synchronized with the data to provide 
additional clocking tolerance. 

The variable-frequency oscillator (VFO) was a circuit 
which offered the potential of reducing many tolerances 
experienced by previously described circuits. The VFO 
had been developed by Newman [37] for tape but was 
redesigned for the high data rate of disk files. The circuit 
was an oscillator whose frequency could be electronically 
adjusted with a feedback loop. Recorded pulse rates were 
compared with an internal oscillator. If the oscillator was 
operating at too low a frequency, an error signal was 
generated to increase the oscillator frequency. The circuit 
eventually locked onto the periodicity of data pulses, but 
lagged a slight amount to generate a continuous error 
signal. An improvement of this circuit by Lang, LaPine, 
and Vaughn [38] integrated the error signal and drove the 
phase as well as the frequency error to zero. The averag- 
ing properties of this circuit reduced to almost zero those 
errors caused by setting the clocking window. The pri- 
mary tolerance of clocking was now associated with the 
data bit rather than the location of the clocking window. 
Initially, double-frequency encoding was used, providing 
a clock pulse with every bit cell as a point of reference. 

The clocking accuracy of a VFO operating at high 
frequencies permitted utilization of an encoding method 
with reduced transitions but required two clock cycles 
per bit cell. The structure of the code made it slightly 
more susceptible to noise, and a new detection system 
known as the delta-V detector was developed by Graham, 
Scovmand, and Swartz [39] to maintain and improve 
reliability. The delta-V detector used a series of tests to 
improve the signal rejection capability over simple ampli- 
tude discrimination. A minimum leading edge slope was 
required, and the signal peak was detected by finding the 
derivative crossover point, and a minimum trailing slope 
was sensed. Finally, the amplitude of the signal associat- 
ed with the pulse had to exceed a minimum level. 
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Through the use of a delay line and logic, these four 
conditions were tested, and if they were satisfied, an 
output pulse was generated. 

More recently. Franaszek-type run-length-limited 
codes [40] have been implemented by Eggenberger and 
Hodges [41]. These codes limit the minimum and maxi- 
mum distance between transitions. For example, the 2.7 
code has a minimum of two zeros and a maximum of 
seven zeros between transitions. At the same time they 
use the transitions with such efficiency that on the 
average 1.5 bits per transition is achieved. A new method 
of detection was developed by D. B. Chapman for use on 
recent IBM disk products with such a 2.7 run-length- 
limited code. The signal is tested to see that the polarity 
of the pulses alternates and that a minimum amplitude is 
achieved. The signal peak is found through differentia- 
tion. and finally, a minimum signal change, occurring 
after the peak, must occur within a specific period of 
time. These four criteria are logically brought together to 
make a detection decision. 

As track densities increased, the readback signal ampli- 
tude was reduced. Furthermore, increasing data rates led 
to introduction of an electronics module on each access 
arm. This module provided the write current as well as 
amplification for the read signal. Since every arm assem- 
bly was designed to use one module containing a total of 
150 high-speed bipolar circuits, low cost was important to 
make this system practical. 

Summary 

When the first disk file was conceived in IBM's newly 
established development laboratory in California, no one 
visualized how far the technology would be extended. 
The first disk file introduced the externally pressurized 
air bearing as a support for the magnetic head, the 24-inch 
disk, the coating technology, and a two-dimensional 
access mechanism. The starting point of areal density was 
100 bpi linearly and 20 tpi radially, or 2000 bits per square 
inch. 

The next generation introduced the use of self-acting 
bearings as a head carrier and the high-performance comb 
actuator. Areal density was increased to 100 000 bits per 
square inch (100 tpi and 1000 bpi), a factor of 50 improve- 
ment. This file, part of the IBM Sabre system for airline 
reservations, was key to the start of interactive process- 
ing as it is known today. 

The disk pack files extended usage to low-end systems. 

It made the disk drive the primary storage for such 
systems. The interchangeable pack permitted it to serve 


both tape and disk functions. A number of the small 
systems were supported by disks exclusively. 

Further increases in capacity resulted from continuing 
work in mechanics, electronics, materials, and processes, 
which further improved storage density. This continuing 
evolution lowered the cost of on-line disk storage to a 
point where it became the work storage for all data 
processing systems, large and small alike, and permitted 
the expansion of on-line applications in the 1960s and 
early 1970s. In addition to substantial improvements in 
heads, disks, and recording electronics, the introduction 
of the track-following servo improved both performance 
and track density significantly. 

Introduction of the Data Module drive with heads in 
contact with the disk as it was started and stopped 
provided a basis for yet another step increase of density. 

With the introduction of the IBM 3350, a full circle of 
disk file concepts had been made. Nonremovable data 
disks had first been introduced in the IBM 350 design. 
Next came the "unlimited capacity" family with custom- 
er-removable disk packs or data modules. As the storage 
technology improved over the years, a removable disk 
file product which held two million bytes in 1963 could, in 
1974, hold 200 million bytes, greatly reducing the custom- 
er needs for off-line storage. With the significant in- 
creases in reliability and availability made during the 
same period, users no longer had a need for disk inter- 
changeability to meet data availability requirements. 
Therefore, the IBM 3350, with over 300 million bytes per 
spindle, was the first file of the 1970s to have nonremov- 
able disks. That this way of reducing tolerances led to 
further increases in density is witnessed by the fact that 
the most recently announced disk file product, the IBM 
3380, shipped in 1981, is also a fixed-disk product with 
1250 million bytes of storage per spindle— a vast improve- 
ment over the five million characters first introduced a 
quarter century earlier. 
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The introduction of new semiconductor products does 
not end with the successful operation of an engineering 
prototype. When these products reach the manufacturing 
stage, new problems must be solved in assimilating new 
and more complex technologies. The rapid progress in 
semiconductor technology has required that manufac- 
turing be highly adaptable, resulting in significant devel- 
opments in yield management, testing, production con- 
trol. and logistical systems. This has led to the most 
highly developed and technologically complex manufac- 
turing facilities of any industry, one in which the use of 
the computer has become indispensible. A paper by 
Harding on semiconductor manufacturing, which follows 
the extensive development of manufacturing technology 
through IBM's major logic products and its influence on 
component design, concludes this chapter. 


From the very early computer products, the increasing 
complexity of their electronic components necessitated 
the development of a computer data base to document 
and support engineering design. Phis spawned the new 
discipline of design automation, w hich uses computers to 
design future generations of computers. Design automa- 
tion has become sine qua non for the design and manutac- 
ture of electronic computer components: The develop- 
ments described in the first three papers of this chapter 
would have been impossible without it. and it is today one 
of the key limiting factors for LSI development. The 
paper by Case cl nl. describes the evolution of IBM's 
design automation systems from their early beginnings. 
Under the constraint of dealing with such a broad subject, 
the authors chose to treat three major areas: design 
verification, test generation, and physical design. While 
there is some treatment of custom design, many contribu- 
tions. especially efforts not in the centralized design 
automation activity, could not be covered. 
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Solid State Memory Development in IBM 


technology i" IBM processors lias undergone a 280 000-fold increase in density, a 20 000 times decrease in 
no.verper hit. and a 10 to 100 times increase in speed during the last twenty-five years. These improvements have brought 
mans advantages to users of information processing equipment, including a 650-fold reduction in the cost per bit of 
memory. Daring this period, processor memory technology evolved from cathode ray storage tubes in the early 1950s, 
through ferrite cores and thin magnetic films in the 1950s and 1960s, to bipolar and M OSFET semiconductor memories in 
the late 1960s through the 1970s. This paper describes these developments and the technical innovations that made them 
possible. It also describes continuing exploratory efforts, including work on magnetic bubbles— the newest solid state 
memory technology. 


Introduction 

This paper describes the rapid progress in memory tech- 
nology which occurred during the last quarter-century 
and highlights the technical contributions of people in 
IBM who helped to make it possible. 

The earliest device used for main memory in electronic 
calculating machines was a mercury sonic delay line 
developed in the late 1940s. This serially addressed 
memory, in which the contents were continuously circu- 
lated. required several milliseconds to read or write 
information. This was slower than desired, even for the 
early electronic calculators. As electronic calculators 
evolved toward stored-program computers in the late 
1940s and early 1950s. a substantial improvement in the 
•need, reliability, and cost of memory technologies was 
essential because an increase in the speed of a stored- 
program computer required nearly equal increases in both 
the speed and capacity of the main memory. 

A cathode-ray-tube (CRT) memory, invented by Wil- 
liams [1] at the University of Manchester, England, 
provided much faster access to stored information. It was 


termed a random access memory (RAM) because access 
time was independent of the memory address. The time 
required to write or read information was about ten ps. 
CRT memories were used in early stored-program com- 
puters such as the IBM 701, first shipped in 1953. 

The use of magnetic cores for a RAM in computers was 
independently invented about 1950 by Forrester at MIT 
[2] and Rajchman at RCA [3], with Forrester eventually 
being recognized as the earlier inventor [4]. The first main 
memory using ferrite cores wired in a three-dimensional 
array as proposed by Forrester was employed in MIT's 
Whirlwind computer in 1953. 

At the same time, MIT and IBM cooperatively devel- 
oped a core memory for SAGE (Semi-Automatic Ground 
Environment), a computer IBM manufactured for the 
U.S. Air Force. IBM also independently designed and 
manufactured core memories for its 704 and 705 comput- 
ers, which were first delivered in December 1955 and 
January 1956, respectively. Whirlwind, SAGE, the ERA 
1 103, and the IBM 704 and 705 employed the first ferrite 
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Table 1 Representative IBM processor memory technologies. 
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jo re memories capable of handling the large quantity of 
instructions .md data required by stored-program com- 
puters. 

Ferrite core memories were soon proven to be faster, 
cheaper, and more reliable than earlier technologies. By 
the time the first IBM Sworn 360 computers were an- 
nounced. IBM had developed advanced memory designs 
and automated manufacturing techniques capable of pro- 
ducing large quantities of memories at a very low cost. 

Numerous research and development projects were 
undertaken by IBM during the l%0s to identify and 
develop promising new technologies. Work on magnetic 
film technology, for example, resulted in the shipment of 
memories on the IBM System 360 Model 95 computer in 
1968 which were five to ten times faster than ferrite core 
memories. However, development work on ferrite core 
memories was stopped by IBM early in 1968. and on 
magnetic film memories in 1969. to permit greater focus 
on semiconductors. Management had gained confidence 
in semiconductor memories following the innovative use 
of a small, high-performance monolithic semiconductor 
memory as an automatic butfer (cache) for a large, low- 
cost ferrite core memory on the IBM System/360 Model 
85 processor. 

With its memory development resources focused on 
semiconductors. IBM was able to deliver the world's first 
commercial computer with an all-semiconductor main 
memory in June 1971. the IBM System/370 Model 145. 
This multi-megabit memory was composed of bipolar 
semiconductor chips less than one-eighth of an inch 
(3.1 mm) on a side, each with 128 bits. In April 1973 the 
IBM System/370 Model 158 computer was shipped with 


the first main memory to use the lower-cost MOSFET 
devices. The first version had 1024 bits per chip and 2048 
bits per module. The n-channel technology pioneered by 
IBM in these devices provided 50 ns access at the chip 
and 150 ns access to a twenty-million-bit memory system. 
Within a few years, monolithic semiconductor memory 
technology had become standard in the industry. In 1979 
IBM began shipping a new semiconductor memory prod- 
uct with over 65 000 bits per chip— the largest number of 
bits per chip achieved in any system to that time. 

A number of IBM processor memory product technolo- 
gies are listed in Table I and serve to illustrate the 
dramatic improvements which occurred during this peri- 
od. Among the many important advantages for the user 
was a reduction in monthly rental per bit of memory from 
3.5tf for the CRT on the IBM 701 to 0.0054c for the 
MOSFET technology on the IBM 4331. 

Ferrite cores 

Information in ferrite core memories is stored by the 
clockwise or counterclockwise direction of the magneti- 
zation around the hole of each donut-shaped ferrite core. 
These two magnetization states represent the logical “1” 
and "0" of digital computer information and can be 
switched by the coincidence of the currents in wires 
through the hole in the donut-shaped core. In three- 
dimensional (3D) arrays, cores are to switch only under 
the influence of positive polarity current pulses on the X 
and Y wires and the absence of a negative (inhibit) pulse 
on the Z wire during a write operation, and under the 
influence of negative current pulses on the X and Y wires 
during a read operation [2]. To operate reliably in this 
mode, the cores must switch when acted on by this 
combination of current pulses and must remain undis- 
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turned when acted upon by a large number of current 
pulses of any other combination. Cores which satisfy this 
requirement are said to have square loops because a plot 

their magnetization on the y axis and applied current 
pulses on the x axis has a square or rectangular shape. In 
early memory designs, a fourth (sense) wire through each 
core was used to detect the output voltage induced when 
a core was switched. Later designs used one of the drive 
wires for this function. 

» Initial developments 

Early successful cores used the magnesium-manganese 
Mg-Mn) ferrite material invented about 1950 by Albers- 
-jhoenberg of General Ceramics [5] or the copper-man- 
.canese (Cu-Mn) ferrite material invented by Beck and 
Matteson of Philips [6]. An effort directed by J. W. 
Gibson of IBM was successful in producing Mg-Mn cores 
for the SAGE, 704. and 705 memories. However, because 
of difficulties in acquiring a second source for cores of 
this composition. IBM began a cooperative effort with 
Philips on cores of the Cu-Mn composition. This compo- 
rition was used in 98.7% of all the cores produced for 
IBM memories. 

A major improvement in the quality and uniformity of 
cores was achieved by Brownlow of IBM in 1956 [7]. He 
achieved a specific oxidation state by rapidly quenching 
the cores from an elevated temperature to room tempera- 
ture during the sintering cycle. Concurrently, Gibson and 
Schuenzel of IBM and Bleyenberg, DeRooy, and Dam of 
Philips discovered that presintering of the material prior 
.0 pressing plus a two-step sintering technique in which 
'he cores were first sintered at a high temperature and 
:hen at a temperature a few hundred degrees lower 
significantly improved the characteristics of Cu-Mn cores 
[ 8 ]. 

IBM and MIT jointly designed rotary press tooling to 
form the donut-shaped toroidal cores for the SAGE 
computer. In 1954 these machines were able to form 
cores at a rate of 30 per minute. This rate was increased to 
6144 cores per minute by 1966 as the result of an 
evolutionary development of rotary press designs by 
IBM. During the same period, the outside diameter of 
cores used in products was reduced from 80 to 21 mil 
(thousandths of an inch; i.e., 2 to 0.5 mm) in order to 
reduce drive current and power requirements. 

The first of IBM's core planes were fabricated by 
soldering the copper wire to a steel needle which was then 
manually probed into a dish filled with cores to place 
mem on the wire. These wires, loaded with cores, were 
soldered to terminals on a frame, and the other three 
groups of wires were hand-threaded. Then persons at 


MIT conceived a technique that automatically positioned 
the cores for wiring by air flow through holes in the 
bottom of a matrix of cavities. Shaw and Link of IBM 
invented the technique of using hollow steel needles 
which were strong enough to be simultaneously threaded 
through 64 columns or rows of cores in the matrix [9]. The 
weaker copper wires were inserted through the hollow 
needles; when their ends protruded, they were grasped 
and the needles were withdrawn. IBM implemented this 
wiring concept in 1955; it reduced the time to thread the X 
and Y wires in a 64 x 64 plane from 25 hours to 12 
minutes. Shaw and Judge of IBM devised a mechanism 
which grasped the X and Y wires after they were threaded 
and wrapped them around terminal posts on the plane 
frames [10]. The time for this operation was thus reduced 
from 2.5 hours to half a minute. 

The generation of X, Y, and Z current pulses was a 
major challenge; currents of 1/4 to 3/4 A with rise times 
on the order of 0. 1 /as were required. Rajchman of RCA 
invented a two-dimensional matrix of transformers made 
with rectangular hysteresis loop cores which reduced the 
number of vacuum tubes for generating the X and Y 
currents by a ratio of 2 \/n/n, where n is the number of X 
or Y windings [11]. An improvement of Rajchman’s 
concept invented by Bloch of IBM reduced the magnitude 
of spurious output currents [12]. Bloch’s switch-core- 
drive matrix concept was used in many memories, includ- 
ing the 12-/AS-cycle, 4096 x 36-bit memory shipped on the 
IBM 704 in December 1955 [13]. 

Of the early IBM core memories for commercial com- 
puters, the fastest was for the 705 computer; it had a cycle 
time of 9 /as and a capacity of 4000 x 35 bit words [14]. 
The noncommercial 4096 x 33-bit memory completed in 
1955 for the SAGE system had a cycle time of only 6 /as. 
All were made with cores of 80 mil outside diameter (OD) 
and 50 mil inside diameter (ID) (2 mm OD and 1.3 mm 
ID). 

• The next generations 

In 1958 IBM announced two new memory products: the 
7301 , for the 7070 processor, with a maximum capacity of 
9990 53-bit words and a cycle time of 6 /as (later reduced 
to 4 /as), and the 7302, for the 7030 (Stretch), 7080, and 
7090 processors, with 16 384 72-bit words and a cycle 
time of 2.18 /as [15]. The core ID-OD was reduced from 
50-80 mils (1.3-2 mm) to 30-50 mils (0.8- 1.3 mm), and 
magnetic characteristics were changed to accomplish 
faster switching speed with approximately the same drive 
current. With the increase in speed, the drive and sense 
windings had to be terminated with resistors equal to their 
characteristic impedance of 100 to 200 Cl. The termina- 
tion in characteristic impedance was needed in order to 
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Fixture 1 Photograph shoeing the edge of a 7302 ferrite core 
plane with 19-32-mil (0.48-0.8 1 -mrru cores. This had a2-/xs cycle 
and was used on the IBM 7094 computer. 


eliminate pulse-shape distortion. This increased the X. Y. 
and Z drive-pulse peak power by about a factor of ten. 
and the rise and fall times of the drive pulses had to be 
reduced. To satisfy these requirements. IBM developed a 
medium-power, graded-based transistor which produced 
60- V 0.6-A pulses with rise and fall times of 0.1 /xs |16|. 

With this transistor, the Z windings could be driven 
directly. However, this was not possible for the X and Y 
windings because of the opposite-polarity read and write 
currents. The load-sharing matrix switch, invented by 
Constantine of IBM, provided a very attractive solution 
to this problem [17]. It combined the currents of several 
transistors to collectively provide the drive for a single X 
or Y winding. A switch having n outputs requires In drive 
transistors as in a simple transformer drive approach; 
however, the power requirement of each transistor is 
reduced by a ratio of \/n. 


Another improvement was the change from a diagonal 
sense winding to one that is parallel to the X or Y 
windings. This not only reduced sense-winding attenua- 
tion and delay, but it also helped lower the drive power 
requirements. An invention by Crawford of IBM further 
reduced the distortion, attenuation, and delay of sense- 
winding signals by dividing each winding into two halves 
and terminating each with resistors equal to the charac- 
teristic impedance [18]. The improvement in IBM’s sec- 
ond-generation core memory technology can be gauged 
by the fact that the first 7090 memory had eight times the 
capacity for only three times the price of the earlier 704 
memory. It was also six times faster and had approxi- 
mately a tenfold improvement in density and power per 
bit, as revealed in Table 1. 



The 7302. as initially designed, had the array immersed 
in transformer oil to remove heat from the cores. A new 
design, completed in 1962. eliminated this requirement by 
the use of a core with smaller ID-OD, 19-32 mils (0.48- 


0.8 1 mm): see Fig. 1 . The cycle time was 2 /xs. and in 1964 
that was reduced to 1.4 /xs for use with the 7094 II 
computer. Two special memories having 1024 72-bit 
words each and a cycle time of only 0.7 /xs were built in 
1960 for the IBM supercomputer HARVEST. These 
memories used the same cores as the 7302s. but they 
employed a two-dimensional, two-core-per-bit, partial- 
switching storage array [19]. 

All the IBM System/360 processors benefitled from 
advances in core memory technology achieved in the 
mid-1960s. The 19-32-mil (0.48-0.8 1-mm) core developed 
for the air-cooled design of the 7302 memory was used in 
the moderately high-speed memories. An invention by 
Flaherty. Hallstead, Judge. Vaudreuil, Wyckoff, and 
Schug of IBM significantly reduced the problem of wiring 
cores into planes by permitting the same wire to be used 
for inhibit drive and sense output [20]. Thus, only three 
instead of four wires had to be threaded through each 
core. The resulting memories had cycle times of about 
2 /xs and a maximum capacity of 1 179 684 bits per array. 
The same designs were also used for IBM’s first System/ 
370 processors, the Models 155 and 165. 

A new 13-2 1-mil (0.33-0.53-mm) core was developed 
for the highest-speed System/360 memories. These mem- 
ories also used only three wires through each core. A 
cycle time of 0.75 /xs was achieved. Maximum capacity 
memories (32 768 x 72 bits) used Constantine’s load- 
sharing matrix switch for driving the X and Y windings^ 
Smaller-capacity designs used “floating” transformer 
output windings connected to X and Y windings through 
semiconductor diode matrices invented by Sommerfield, 
Norton, and Cooper of IBM [21]! This significantly 
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reduced the number of drive transistors. Cycle times of 
memories using this drive technique were in the 0.9- 1.5- 
/xs range. 

The System/360 memory designed for maximum capac- 
ity and lowest cost per bit was the 2361, announced in 
1964. It used a 2V2-D concept and provided 252 144 72-bit 
words and a cycle time of 8.0 /xs. Low cost was accom- 
plished by the use of very large (256 x 1 152-bit) core 
planes with only two wires threaded through each core. 
The invention by Thome of IBM provided the means for 
doing this by utilizing the same wire for both the drive and 
r he sense functions [22]. 

i Final developments 

A 1 l()-ns-cycle memory was built by Werner. Whalen, 
Lockhart, and Flaker at IBM in 1966 with a capacity of 
8192 72-bit words [23]. It was a 2D design employing 7.5- 
13.3-mil (0. 19-0.34-mm) cores. This memory used the 
two-core-per-bit partial-switching technique previously 
used in the 0.7-/xs HARVEST memories. One important 
difference was the use of a ground plane which was 
placed close to the windings and thereby reduced and 
controlled their impedance. This memory was not placed 
in manufacturing because of the higher performance 
projected for magnetic film memories. 

In the late 1960s another advanced core memory was 
developed for IBM's commercial computers by Councill 
et al. [24]. It had a capacity of 16 384 18-bit words and a 
cycle time of 275 ns. This was a 3D memory using 8- 13.4- 
mil (0.20-0.34-mm) cores with a switching time of about 
50 ns. The array and drive circuits were immersed in an 
:nert lluorochemical liquid to prevent overheating. 

The project also produced a better means of testing and 
wiring cores. A machine invented by Baker, Cadwal- 
lader, and Marinelli of IBM, which grasped the cores on 
the outside and placed them on one of twenty-five probes, 
was implemented in 1968 [25]. It increased the test rate to 
180 cores per second. A scheme developed by Hazel and 
Mueller tested the cores during the wiring operation [26]. 
Unsatisfactory cores were broken and removed, the last 
wire that threaded them was withdrawn, and substitute 
cores were moved into the same positions. This avoided 
the expensive operation of cutting and splicing wires to 
remove faulty cores. 

The 275-ns-cycle ferrite core memory developed for 
IBM System/370 requirements was technically success- 
ful; however, development of manufacturing tooling was 
stopped in January 1968 because of the expectation that 
semiconductor memories and magnetic film memories 
would soon provide superior cost and performance. Nev- 
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ertheless, ferrite cores had provided the most satisfactory 
processor memory technology for IBM over a period of 
nearly two decades— a period during which digital com- 
puter logic made the transition from vacuum tube to 
transistor to integrated circuit technologies. 

Magnetic films 

The first experimental magnetic core memories used 
ceramic bobbins about which thin metallic strips of 
ferromagnetic nickel-iron were wrapped. Eddy currents 
induced in the metal strips caused these cores to switch 
very slowly, so they were soon replaced by faster- 
switching nonconducting ferrite cores. However, Blois 
[27] suggested that nickel-iron films deposited on a sub- 
strate from a vapor could be made thin enough that eddy 
currents would not retard the magnetization switching 
process. He further believed that very fast memories 
could be built with magnetic films because of the simple, 
ultra-fast magnetization reversal that was theoretically 
predicted. j| 

Observations in 1956-57 by Smith and by Olson and 
Pohm [28] showed that the magnetization reversal pro- 
cess in flat films was not so simple. Only by applying a 
field perpendicular to the magnetization direction were 
they able to achieve the fast magnetization reversal 
predicted for a single-domain rotation process. Even 
then, the reversal phenomenon was neither as fast nor as 
simple as predicted by theory. The complex mechanism 
of magnetization reversal in films was finally explained by 
Methfessel, Middelhoek, and Thomas [29] of the IBM 
Zurich Research Laboratory in 1961. Magnetization re- 
versal was shown to consist of a partial rotation process 
in which the more easily rotated regions switch reversibly 
and the large stray fields from them tend to counter- 
reverse adjacent regions, which then cannot switch rota- 
tionally without the application of very much larger 
fields. Their results were consistent with the first high- 
speed sampling-oscilloscope measurements (with an 
overall resolution of 0.35 ns) made by Dietrich and Proeb- 
ster [30] of the same laboratory. Suits and Pugh [3 1] in the 
IBM Poughkeepsie Laboratory subsequently showed that 
a much higher percentage of the film could be caused to 
switch rotationally if the drive line were sandwiched 
between two Permalloy™ films. 

In July 1959 Raffel [32] and his group at Lincoln 
Laboratories placed the first thin-film memory in opera- 
tion with the experimental TX-2 computer. It had a small 
32 x 10-bit array, consisting of 1/16-inch (1.8-mm)-diame- 
ter, 700-A (70-nm)-thick spots of Permalloy™ (trademark 
of Allegheny Ludlum Industries, Pittsburgh, PA) on a 
glass substrate; it was operated with a read-write cycle as 
short as 800 ns. At about the same time, Bradley [33] of 589 
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rated on highly polished 2.5-cm-square coin silver sub- 
strates coated with silicon oxide. Techniques for achiev- 
ing reproducible magnetic films on metallic substrates 
were pioneered in IBM by Bertelsen and Mohr [36], 

In November 1962. a product development program 
was initiated which resulted in the magnetic film memory 
shipped in February 1968 on the IBM System/360 Model 
95. The four magnetic film memory units on each Model 
95 had cycle times of 120 ns and provided 60-ns access to 
9.4 million bits or one million bytes of information. This 
was six times faster than the fastest ferrite core memory 
in the IBM System.360 line of computers. Each memory 
unit consisted of four 1024-word by 288-bit assemblies 
which were driven and sensed in parallel, but only 72 of 
the 288 bits read in parallel were gated out. Each assem- 
bly contained two arrays of 72 3-inch (7.6-cm) bit plates 
arranged back to back in a 6 x 6 configuration. An early 
version of this memory was described by Simkins in 1965 
[37]. 



Figure 2 Photograph of a three-inch square magnetic him hit 
plate with 4 1 "6 hits. These were first shipped on the IBM 
System. -Ml Model ■)? in February IV68. 


international Computers and Tabulators operated a 5(1 < 
50-bit irrav of elements selected from a continuous sheet 
of Gendlin™ i trademark of Hewitt-Robins, Inc.. Stam- 
ford. CT). later revealed to be NiFe with some Co. The 
magnetic film was deposited on aluminum substrates 
which served as ground planes for the drive and sense 
currents. Both of these approaches made use of an 
orthogonal drive scheme to provide nearly rotational 
magnetization reversal. In this scheme a large word-line 
current is used to drive all the films on a given line 
perpendicular to the "1" or "0 " storage states. Positive 
or negative currents in the orthogonal bit lines determine 
whether a "1" or "0" is written as the word pulse is 
terminated. 

Interest in thin-magnetic-film memories increased 
sharply in October 1961 when Rubens [34] of Univac 
described that company's plans to ship a 600-ns-cycle 
thin-magnetic-film memory with 4608 bits on the Univac 
1107. In February 1962, an experimental 18 432-bit mem- 
ory with a 60-ns access and a few-hundred-ns cycle was 
demonstrated by Proebster et it/. [35] in the IBM Zurich 
Laboratory. The bits in this memory were 0.3 mm by 
0.65 mm and 500 A (50 nml thick, and they were evapo- 


The memory array developed by Pugh el al. [38] 
consisted of copper bit plates on which 900 A (90 nm) of 
Permalloy™ was deposited over a smoothing layer of 
dielectric and etched into 4176 rectangular bits 25 x 30 
mil (0.63 :< 0.76 mm) on a side (see Fig. 2). All bits were 
required to be good, but 80 were not wired and were 
reserved for repairs. The rectangular Permalloy™ bits 
were electrically switched and detected by copper con- 
ductors on Mylar “ (registered trademark of E. 1. du Pont 
de Nemours and Co.. Wilmington, DE) foils placed 
directly over the copper bit plates. These conductor foils 
were backed by a flexible ferrite keeper [39] and support- 
ed hy an aluminum frame which was long enough to 
accommodate six bit plates in a row. 

Follow-on thin-film memory designs were developed in 
IBM which offered lower cost and higher performance 
than could he projected either for ferrite cores or for 
plated-wire memories [40], and even more advanced 
concepts using coupled film arrays deposited directly on 
silicon chips with integrated support circuits were consid- 
ered [41]. However, these film memories were never 
manufactured. All development work on film memories 
was terminated in IBM by the end of 1969 due to the rapid 
progress in semiconductor memory technology which is 
described in the next two sections. 

IBM's magnetic film memory was used only on the 
System/360 Model 95 computers. However, it provided. _ 
the highest-performance megabyte memory in existence 
for many years. With over 4000 good bits on a single 
substrate, it had achieved a level of batch fabrication not 
reached by semiconductors until several years later. 
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Bipolar memory 

Bv I960 the semiconductor industry had started develop- 
ment efforts on bipolar integrated circuits. The objective 
was to achieve electrical isolation of the active compo- 
nents of a circuit [42] on a single silicon substrate. This 
powerful concept permitted complete circuits to be fabri- 
cated in a single semiconductor chip. The first work in 
IBM. which applied these integrated circuit techniques to 
the fabrication of memory, was started in 1963. 

At this time, magnetic core memories had been very 
successfully applied to digital computers. They provided 
nonvolatile low-cost storage with cycle times as low as 
fractions of a microsecond. Magnetic film memories had 
similar attributes plus projected cycle times under 100 ns. 
However, the nonvolatility of these memories was 
achieved by the use of passive magnetic storage elements 
that required semiconductor devices to perform the de- 
code. drive, and sense functions. The integration of 
magnetic with semiconductor components involved me- 
chanical assemblies, and the semiconductor device re- 
quirements for the higher-speed memories were generally 
quite severe in terms of power, current, and voltage. 
Furthermore, the mechanical and electrical problems 
became increasingly severe as devices were miniaturized 
and as array capacities were increased to reduce the cost 
per bit. 

Monolithic memories, in contrast, offered the attractive 
possibility of integrating the storage array with the drive, 
decode, and sense circuits— all on one silicon chip. 
Memory cost per bit was more nearly independent of 
memory capacity. However, the pervasive application of 
this technology to main memory technology was regarded 
by many as unlikely. The reasons cited were the relative- 
ly high cost per bit of large-capacity memories, the high 
power required to store information, and the poor reliabil- 
ity of semiconductor storage as compared with nonvola- 
tile magnetic storage. 

These presumed disadvantages stimulated a number of 
developments within IBM. To address the cost problem, 
development efforts concentrated on finding memory 
cells which occupied the least chip area. The power 
problem in memory was greatly reduced by the develop- 
ment of pulse power techniques as first proposed by 
Henle [43]. The potential reliability problem was solved 
by the application of error-correcting codes to the memo- 
ry at the systems level. In addition, the reliability of 
semiconductor storage proved to be much better than was 
originally estimated. The volatility problem proved not to 
he a significant factor in most computer applications. 
Even magnetic memories were rarely designed to provide 
integrity of the stored data if power was lost. 
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• Special-purpose memories 

The first monolithic memory application in IBM was for a 
storage-protect memory on the System/360 Models 91 
and 95. This unit was to contain the code which logically 
prevented access to specified blocks of main memory. 
The ferrite core version of this unit was a 64-word, 20-bit 
memory with a cycle time of 350 ns, whereas the mono- 
lithic version would have a 128-word, 20-bit array and a 
300-ns cycle. The project was initiated in January 1965. 

The memory circuit used in this chip was conceived by 
Farber and Schlig, and an improved version incorporating 
diode loads was first published in May of 1966 [44]. The 
complete cell used five transistors. It was larger than 
other cells which were known at this time but was chosen 
for its high-performance, nondestructive read and its 
large tolerance to component variations. The first. Phase 
1 version was described by Agusta, Bardell, and Cas- 
trucci at the October 1965 Professional Group on Elec- 
tron Devices [45], It consisted of a 4 x 4 array of Farber- 
Schlig cells and contained no decode or drive circuits. 
Two of these chips were mounted on a half-inch ( 1 .27-cm) 
module with six modules on a card. Total memory 
capacity in the application was 1024 9-bit words. The 
monolithic memory protect was announced on the Model 
91 in January 1966, making it the first announced applica- 
tion of monolithic memory technology in a commercial 
system. 



The second application of monolithic semiconductor 
memories was for the automatically buffered cache mem- 
ory system [46] first implemented on System/360 Model 
85. A cache memory is designed to hold that portion of 
the main memory content that is currently being used. 

This is accomplished by a least-recently-used (LRU) 
algorithm implemented in hardware. Most of the proces- 
sor memory references can then be handled by the cache 
with its short access time. For example, in the Model 85, 
a 16K-byte cache of 80-ns cycle and a multimegabyte 
main store with a l-/zs cycle achieved a performance 80% 
as good as would have been achieved by a single main 
memory of 80-ns cycle [47], This resulted in a substantial 
improvement in the cost/performance ratio of the memo- 
ry system and represented a major systems innovation on 
the IBM System/360 Model 85. 

The requirement for a small fast memory for the cache 
application was well suited to the capability of a bipolar 
technology. The Phase 2 memory was developed with 64 
bits on a 2.85-mm-square chip. The memory cell was an 
improved version of the one used in the Phase 1 chip. The 
Phase 2 chip was qualified in September 1967 and later 
described in two companion papers presented at the 
February 1969 International Solid State Circuits Confer- 591 
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Figure 3 \ 128-hit bipolar chip of the type used in the first all- 

semiconductor main memory which was announced on the IBM 
System .'"O Model 145 in September 1970. The core plane 
supporting it has 30-50-mil cores. I he smallest cores u>ed by 
IBM in a product were 13-21 mil. 


cnee iISSCC). The first paper, by Agusta [48]. described 
the memory chip. The second paper, by Ayling. Moore, 
and Tu |49], described the memory circuits and the chip 
organization. Details of the memory cell were published 
in August 1972 by Father and Schlig [50]. 

The Phase 2 chip was the first to contain on-chip driver 
circuits in addition to the array of storage cells. The chip 
had a power dissipation of 112 mW and a differential 
output signal of 2 mA. The X and Y access lines were 
brought to the terminals of the chip with a nondestructive 
read delay of 7 ns and a chip writing time of 12 ns. The 
chip was fabricated with 5-/am minimum image lithogra- 
phy and a single level of aluminum interconnections. Two 
of these chips were mounted on a 23-pin, half-inch (1.27- 
cm) module. A single chip integrated the equivalent of 664 
transistors, diodes, and other components. Phase 2 tech- 
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nology was also used for cache memories in the System/ 
370 Models 165 and 155 with 80-ns and 1 15-ns access 
times, respectively [51]. 

Following the 64-bit chip development, a 32 x 3-bit 
chip was developed. It also used a 5-/im minimum-feature 
technology and was designed in the 1968-69 time period. 
This chip utilized a new cell configuration invented by 
Harper of IBM [52], having two cross-coupled double- 
emitter devices with resistor loads. Two of the emitters 
were joined to a common read line: the other two emitters 
each went to a separate bit line. This was the first chip to 
utilize full on-chip decoding. It was used for storage- 
protect and buffer applications on System/370 Models 158 
and 168. 


The 32 x 3 RAM chip was redesigned in an oxide- 
isolated 2-/xm-epitaxy technology with a modified Harper 
cell and was qualified for production in 1976. The cell 
modifications included the addition of Schottky diodes 
across the load resistor and two read lines from each cell 
which were obtained by means of emitter follower out- 
puts on the collectors of the cell. This modified chip has 
been used in the cache and the channel circuits of the 
IBM 3030 systems. 


At the 1971 ISSCC. Wiedmann and Berger of IBM 
described a memory cell utilizing lateral pnp loads [53]. In 
1973. at the ISSCC. Wiedmann described an improved 
cell with lower power and better performance [54]. The 
cell used cross-coupled npn transistors with lateral pnp 
loads. It was designed into a 512 x 10 array (1-bit 
redundancy) and a number of smaller arrays, all of which 
are used in the IBM 4300 systems. The access times of 
these arrays are in the 30-40-ns region. They are pack- 
aged with logic chips on multilayer ceramic substrates. 

• Main memories 

In January 1968 the decision was made at IBM to stop 
developing ferrite-core commercial memories in order to 
concentrate on monolithic memories. The Phase 21 128- 
bit chip had been developed. This was the first monolithic 
chip with the potential for replacing ferrite memories on a 
cost basis. The objective was to break even in cost and go 
ahead in future extendibility. 


The Phase 21 design utilized the Harper cell and pulse 
power with the peripheral circuits being turned on and off 
[55]. The organization of the chip was 128 x 1. The chip 
(shown in Fig. 3) was qualified in August 1969. Two of 
these chips were mounted on a half-inch (1.27-cm) ceram- 
ic layer with two of these layers being stacked, resulting 
in a 512 x 1-bit module. This module contained 5700 
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components occupying one quarter of a square inch of 
board space. Twenty-four of these modules were mount- 
ed on a 4.5 x 3.5-in. (11.1 x 0.9-cm) card and thirty-six of 
these cards on a 9 x 13-in. (23 x 43-cm) board for a total 
storage capacity of 49 152 nine-bit bytes. This design was 
used as the main memory for the System/370 Models 145 
and 135 [56]. These IBM systems were announced on 
September 23. 1970, and on March 8, 1971, respectively. 
With shipment of the first Model 145 in June 1971, IBM 
had achieved the first commercial application of semicon- 
ductor main memories. 

\ three-quarter-good program was also implemented 
.vim the Phase 21 chip. This permitted 75% utilization of 
the bits on a chip containing a single, or in some cases 
multiple, malfunctioning bits. It was the first effort to 
enhance yields and reduce costs by a partially good chip 
strategy [57]. 

The 128-bit chip technology was described in two 
companion papers which were presented at the 1971 
ISSCC. Gates, McKinney, and North [58] described the 
chip design procedures, and Ayling and Moore [59] 
described the circuits and logical organization of the chip 
and memory card. A 256-bit chip of this design was put 
into production as a cost reduction toward the end of the 
program. The ceramic module was a one-layer design 
holding two chips. 

By 1971, work on an improved bipolar memory tech- 
nology was well underway. The main features were the 
•ise of two levels of metal, a simpler cell configuration 
.insisting of two cross-coupled npn transistors with 
dumbbell resistor loads, a self-aligned emitter process, 
and the addition of Schottky diode gates. In addition, a 
lateral pnp was used for the bit decode circuits [60]. 

The design for a IK RAM was completed about March 
1972 and was applied to a product in June of 1972. The 
chip size was 3.0 x 3.8 mm. The memory units construct- 
ed from this technology ran with approximately 90 ns 
access time at the board level and cycle times of about 
110 ns with pulse-powering capability. At the system 
level, where the technology was combined with error- 
correcting codes (ECC), cycle times were somewhat 
slower. When used as a buffer memory without pulse 
powering, the chip was capable of a 1 3-ns access and a 45- 
ns cycle. The chip was incorporated into a4K x 1 module 
which consisted of two half-inch (1.27-cm) stacked ce- 
ramic substrates with two chips per layer. The chip was 
first used in the System/370 Models 138 and 148 and later 
has been used as a cache technology for Model 168 
through to the IBM 3033. 
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• Future developments 

Two factors limited the further application of bipolar 
technology to main memory. The first was the emergence 
of FET technology with fewer masking steps and a high- 
yield process which combined to produce high-density, 
low-power, and low-cost memory designs. The second 
factor was the acceptance of the cache memory concept. 
As stated previously, the cache memory removed the 
high-performance requirement from the main memory 
technology, permitting the slower FET memories to fulfill 
this role. The continued future application of bipolar 
technology to high-speed register and cache applications 
seems secure for the foreseeable future [61]. The scaling 
principles, while not as simply stated as those for FET 
technology, still apply. Smaller dimensions result in low^ 
er power, improved performance, and improved produc- 
tivity. For the same lithography dimensions, bipolar 
memories will continue to offer higher performance than 
their FET counterparts. The question is whether bipolar 
memories may, at some future time, re-emerge as a main 
memory technology [62]. In either technology, the most 
dense storage cell is likely to remain some form of 
dynamic charge storage. 

MOSFET memory 

The principle of the surface field effect transistor was 
proposed about fifty years ago [63]. Advances which led 
to practical implementation of the metal oxide semicon- 
ductor field effect transistor (MOSFET) took place in the 
late 1950s and early 1960s. Notable among the many 
contributions were the demonstration of a thermally 
grown oxide on silicon as a gate insulator in 1960 by 
Kahng and Attalla [64] and the publication of a compre- 
hensive paper on MOSFET device design and character- 
istics in 1963 by Hofstein and Heiman [65]. 

Distinct from the rest of the industry, which pursued 
the p-channel technology, IBM made an early decision to 
develop the more difficult n-channel technology because 
of its higher performance potential. As a result, IBM 
pioneered the n-channel technology throughout the 1960s 
and began manufacturing high-speed (about 50-ns access) 
1024-bit memory chips in 1971. Memory systems as large 
as 2M bytes (almost 20 million bits) were announced on 
the IBM System/370 Model 158 in August 1972 with 
shipments starting in April 1973. Probably the first com- 
mercially successful p-channel product was the 1024-bit 
chip (the 1 103) which was announced by Intel [66] in 1970 
and shipped in large commercial computer systems in the 
1972-73 time frame. 

The first 64K-bit dynamic RAM engineering parts in 
IBM were fabricated in 1976, manufacture started in 
1977, and 2M-byte memories were shipped in the IBM 
4341 system in 1979. 
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• Early n-channel work 

Early work in several laboratories throughout the indus- 
try indicated that the new MOSFET device would be the 
best choice for achieving many circuits on a single chip 
because of its expected process simplicity and high 
density. A decision was made by the IBM Research 
Division in 1963 to mount a comprehensive LSI program. 
The term LSI was coined at that time in IBM to denote 
Large Scale Integration. To the authors' knowledge this 
was the beginning of the use of that term throughout the 
industry. 

The n-channel technology was chosen by IBM in 1963 
because it had potential for operating at a higher speed 
than the alternative p-channel technology. This choice 
was made in spite of the fact that the n-channel technolo- 
gy was expected to be more difficult to implement. 
Surface scattering and high field effects were expected to 
diminish its potential three-times mobility advantage. The 
field polarity in the gate insulators moved positively 
charged contaminants toward the silicon surface, thus 
reducing device threshold voltages and increasing leakage 
and surface scattering problems. Also, unless an extreme- 
ly high substrate doping was used, the n-channel MOS- 
FET was a depletion-mode or normally-on device. This 
limited its circuit usefulness and created a severe surface 
leakage problem. 

Several key discoveries in 1963 and 1964 made the n- 
channel technology viable. Cheroff, Fang, and Hochberg 
[67, 68] and Lehman [69] of IBM showed that a proper 
low-temperature annealing process would stabilize the n- 
channel device mobility at a high value by eliminating fast 
surface states [67], ( Lehman's work [69] also showed that 
silicon could be used as the gate electrode material.) The 
n-channel devices had been essentially nonoperational to 
this point. Hydrogen was shown to be an important factor 
in this annealing process [70], A second key concept, of 
Cheroff, Fang, and Seraphim [71] and of Heiman of RCA 
[72], was the use of a substrate bias to control device 
threshold voltages and thereby turn off both the n-channel 
switching devices and the parasitic thick oxide devices. A 
third element was the work of Kerr, Logan, Burkhardt, 
and Pliskin of IBM [73], which showed that phospho- 
silicate glass (PSG) over an Si0 2 film resulted in an 
electrically stable structure with large latitudes in cleanli- 
ness and process control. 

From 1964 to 1968 considerable effort went into under- 
standing device physics and establishing a reliable and 
reproducible process. During this period, there were 
many other laboratories in the industry which were also 
very active in basic MOSFET work. For example, Snow, 
Grove, Deal, and Sah of Fairchild Semiconductor Labo- 



ratories showed that Na~ was the major contributor to 
threshold instability [74]. Later they showed that polar- 
ization of the PSG could introduce a threshold shift [75], 

A critical accomplishment in IBM was the work of Balk 
and Eldridge in the mid-1960s which showed how to 
optimize the thickness and concentrations of the PSG 
layer to achieve stabilization under practical fabrication 
conditions without significant polarization effects [76], 
Also, Chou and Eldridge [77] showed that the PSG layer 
improved gate reliability since it forms a barrier between 
the aluminum gate electrode and the thin oxide. This, 
combined with improvements in the purity and structural 
perfection of the SiO, insulator resulting from the imple- 
mentation of a super-clean dry oxidation process in 1963, 
resulted in dramatic improvements in yield, breakdown 
voltage, and reliability. 

A final ingredient of an optimized n-channel device was 
the substrate crystal orientation. Work reported in 1965 
by DeLord et al. of Reed College based on experimental 
work at Tektronix [78] and by Balk, Gregor, and Burk- 
hardt from IBM [79] showed that (100 ) surfaces exhibited 
a lower charge. Then Fang and Fowler demonstrated that 
the (100) surface also had a higher mobility [80]. This 
firmly fixed the choice of the (100) orientation for n- 
channel devices. 

The early IBM work was largely devoted to LSI logic. 

By mid-1966, the basic n-channel device, circuit, and chip ' 
designs were established [81]. An automatic artwork ; 
generator, which set the direction for the industry, was 
developed in 1965-66 and was used to make the 10X 
masks for the experimental MOSFET chips [82]. 

The use of a substrate bias for the n-channel device 
proved to be a critical element. Not only did the bias turn 
off the switching device and reduce leakage, but it provid- • j 
ed some very important design flexibilities [83]. It al- 
lowed a dramatic reduction in the field oxide thickness 
needed to maintain proper isolation between devices. 
Also, it allowed the use of lower substrate doping which 
reduced substrate sensitivity effects and junction capaci- 
tance. 

In the mid-1960s, it became apparent that semiconduc- 
tor memory had the potential of replacing ferrite core j 
systems for main memory applications. Pleshko and 
Terman of IBM showed that high-speed memories could j 
be implemented by using six-transistor MOSFET static 
cells in the memory array and by using bipolar circuits to 
provide the high-speed decoding, driving, and sense- ■ 
amplifier functions on separate chips [84]. A series of,; 
memory test chips were fabricated , including a 48-bit chip 
in 1966, a 128-bit chip in 1967, and a cross section of a 
512-bit chip in early 1968. 
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These advances were paralleled in the mid-to-late 1960s 
by the development in IBM of dynamic memory cells 
which promised further density improvements: the four- 
transistor cell [85], a three-transistor cell [86], and the 
one-transistor cell [87], The one-transistor cell, conceived 
by R. Dennard, uses a MOSFET device as a switch and a 
capacitor as the storage element. It has become the 
standard high-density cell in the industry. 

In April 1967 a one-year pre-product memory develop- 
ment program was set up between IBM Research and the 
product laboratories. By 1968 a process [88] suitable for 
making complex development parts had been practiced. 
Considerable data had been gathered on reproducibility, 
gate shorts, and reliability under stress; and an experi- 
mental 16K-bit hardware model was assembled. This 
model was run successfully under test for two years. 

A final requirement was the use of the IBM flip-chip 
attachment techniques. The residual charge and radiation 
damage from the required sputtered quartz overcoat were 
prime concerns. The enhanced design margin provided by 
the substrate bias and the PSG passivation, combined 
with clean sputtering techniques [89], resulted in a manu- 
facturable process. A 512-bit product prototype was 
designed and built in late 1968. A key innovation by G. 
Sonoda [90] was the bootstrap circuit which allows a gate 
to be boosted to a voltage higher than the power supply in 
order to overcome device threshold effects. 

The very tight coupling between the device, circuit, 
and technology efforts from 1963 through 1968 made it 
possible for a comprehensive device and chip design 
manual to be written which is still widely used in IBM as a 
basic MOSFET reference. The highlights of the manual 
were published externally by Critchlow, Dennard, and 
Schuster in 1973 [91]. The device design was centered on 
a 50-nm gate oxide thickness with a 4-pm channel length. 
The gate oxide thickness was later increased to 70 nm for 
the actual products. 

• MOSFET memory products 

Preproduction parts using 512-bit chips were successfully 
made in 1969 and 1970 and delivered to system builders. 
The process and design were qualified to meet reliability 
and yield requirements and a 1024-bit chip was put into 
production in 1971. This product was soon upgraded to a 
2048-bit chip [92]. 

Four 2048-bit chips with a nominal access time of 50 ns 
were packaged on an 8K-bit multichip half-inch ceramic 
module. The memory and support modules were pack- 
aged on an 11.5 x 8.5-cm card to provide 32K x 8-bit 
(256K-bit) memory cards which were plugged into boards 
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Figure 4 The double-layer, one-inch ceramic module holds 
eight 64K-bit SAMOS chips for a total of 512K bits. IBM 
announced this memory technology in October 1979. 


to provide a 2M-byte memory system. Error-correction 
coding techniques were used at the memory system level 
to meet reliability, availability, and serviceability require- 
ments. A partially good chip approach [57] was used to 
increase productivity in early manufacturing by utilizing 
the portion of the chip which met specifications. This 
technology, with a number of process and design en- 
hancements such as the substrate voltage generator [93], 
was the mainstay of the MOSFET memory and logic 
products in IBM throughout the 1970s. 

The main thrust of the second generation of IBM RAM 
products was exploitation of the high-density potential of 
the one-transistor cell to achieve a low cost per bit. After 
extensive study, the decision was made in 1972 to devel- 
op a unique metal gate MOSFET technology called 
SAMOS (silicon and aluminum metal oxide semiconduc- 
tor). The process, invented by Garnache and Smith of 
IBM [94], uses a 45-nm-equivalent nitride-oxide gate 
insulator for enhanced gate reliability and a polysilicon 
field shield to reduce surface leakage to very low levels. 

A series of product designs [95 , 96] were developed using 
line widths as narrow as 2.5 pm. 

The first 64K-bit chips were made in 1975, engineering 
parts were shipped to systems developers in 1976, manu- 
facturing was started in 1977, and systems containing 
these chips were shipped in 1979. Thus, the decision to 

implement the unique SAMOS process in 1972 provided :;L . 

IBM with a 64K-bit-chip memory product well before its 
competition. A very high packaging density is achieved 

by placing eight 64K-bit chips on a double-layer, one-inch 595 
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ceramic module allowing a lM-byte memory to be pack- 
aged on an 11.3 by 17.6-cm card (see Fig. 4). 

This program encountered four major technical chal- 
lenges. First, the very severe leakage requirements led to 
the use of the field shield and the development of im- 
proved gettering techniques. Second, the small geome- 
tries resulted in high fields with the possibility of hot- 
electron injection into the gate insulator. This led to 


There continue to be device innovations. The Merged 
Charge Memory (MCM) structure [106], which allows 
Charge-Coupled Device (CCD) concepts to be used to 
fabricate dense RAM, is interesting in that it may provide 
a cost /performance tradeoff between RAM and CCD» 
Improved device structures such as the Lightly Doped 
Drain (LDD) MOSFET [107] are extending the limits of 
the technology and promise further speed and density 
improvements. Advances in CMOS technology indicate . 


extensive research on hot-electron effects [97] which has 
defined one of the limits of MOSFET technologies. Third, 
the small signal charges of only 200 to 300 fC for the 64K- 
bit chip required sense-amplifier innovations [98]. The 
final challenge, of attaining acceptable yields, was allevi- 
ated by the use of redundancy techniques in which extra 
bit and word lines are provided to replace defective lines 
or cells [99]. Fusible links are fabricated on the second 
level of metallization to effect the wiring changes. Two 
further novel features of the SAMOS process are the use 
of a lift-off metallization technique [96, 100] and the use 
of a two-level insulator consisting of sputtered quartz 
covered by polyimide [96]. 

• Future directions 

A major trend in MOSFET technology has been to scale 
to smaller dimensions and voltages in order to achieve 
enhancements in density, performance, and power [101]. 
IBM has been very active in the development of ad- 
vanced lithography techniques including the development 
and use of electron-beam direct write exposure systems 
[102]. Designs and processes have been developed in the 
laboratory and practiced on complex chips to demon- 
strate their viability and to quantify practical design 
limits. For example, an experimental 8K-bit silicon gate 
memory chip was fabricated with 1.25-/im minimum 
dimensions in 1975 [ 103 J. This technology was then 
developed to a much higher degree and used to fabricate 
complex logic chips in the 1-^tm Logic and Array Test 
Vehicle (LATV) project from 1975 to 1978 [104]. 

As circuit miniaturization is pushed to the limit, prob- 
lems of high current densities, hot electrons, and ionizing 
radiation are becoming more significant. Basic work and 
innovation continue. For example, as current densities, 
resistive voltages, and time constants became limiting 
factors, IBM engineers reported that a second level of 
metal provides a significant performance advantage for 
high-speed designs [105]. IBM has pioneered the use of 
the metal lift-off technique [100], which allows line widths 
to be greatly reduced while not reducing the line thick- 
ness in proportion. Significant contributions have also 
been made to the use of low-resistivity refractory metal/ 
polysilicon conductor technology to overcome the speed 
596 limitations of circuits using polysilicon conductors [104]. 


E. W. PUGH ET AL. 


that it will become more significant in the future. Clearly 
we will see further dramatic enhancements of MOSFET 
memory technology during the 1980s. 

Magnetic bubbles 

Bobeck of Bell Laboratories first proposed the use of 
magnetic bubbles for information storage in orthoferrites 
[108] in 1967 and then in garnet films [109] in 1970. 
Researchers at IBM quickly became interested because 
there was reason to believe this technology might offer 
the nonvolatility of ferrite cores and magnetic films with a 
cost even lower than projected for semiconductor memo- 


Significant contributions were made to the develop- 
ment of garnet compositions and epitaxial growth tech- 
niques by Giess et al. [110] in 1971. Amorphous Gd-Co 
films were developed by Chaudhari et al. [1 1 1] and shown 
to support very small magnetic bubbles. The first report- 
ed observation of velocity saturation of magnetic bubbles 
was by Calhoun, Giess, and Rosier [112]. The theory for 
this phenomenon was developed by Slonczewski [113], 
and its significance to bubble devices was more fully 
developed by Argyle, Slonczewski, and Mayadas [1 14]. A 
major contribution to the modeling of bubble devices was 
the closed-form equations of Almasi and Lin [1 15], which 
were based on a magnetic circuit model for bubble- 
Permalloy™ interactions. 

Almasi, Chang, Keefe, and Thompson [116] were the 
first to propose magnetoresistive sensing of magnetic 
bubbles, which is now used in all magnetic bubble de- 
signs. Almasi et al. [117] were also the first to report the 
operation of a fully functional bubble chip. This chip, 
with 12-/xm-diameter bubbles, used magnetoresistive 
sensing as well as unique on-chip decoding. In 1973 an 
IBM interdivisional program produced a packaged 1000- 
bit-chip device which achieved nonvolatile operation at 
100 kHz. It incorporated the first in-package preamplifier 
for noise reduction as well as magnetoresistive sensing 
[1 18]. It also made use of propagation elements deposited 
directly on the garnet as proposed by Shick et al. [119] 
and the major-minor loop organization proposed by 
Bonyhard et al. [120] of Bell Laboratories. Subsequent 
IBM work on “conventional” bubble devices was aimed 
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primarily at the use of e-beam [121] and x-ray [122] 
lithography for device miniaturization. 

» Contiguous-disk devices 

The observation of bubble propagation around masked 
contiguous disks in ion-implanted garnets reported by 
Wolfe et al. [123] of Bell Laboratories in 1972 generated 
substantial interest in IBM, where an internal study of the 
potential and limits of bubble technology had already 
started a search for gapless structures in order to achieve 
higher densities and lower costs. A major breakthrough 
was the identification of a magnetically charged domain 
wall as the propagation mechanism in these new ion- 
implanted devices [124]. Making use of this mechanism, 
Lin. Almasi, and Keefe [125] achieved the first successful 
demonstration of all-contiguous-disk device functions in 
1974. A written account of this 5-p.m bubble device first 
appeared in a review article in 1977. Propagation paths 
were defined on the garnet by a gold contiguous-disk 
pattern which was used to mask out ions when the rest of 
the garnet surface was implanted by high-energy ions. 
After ion implantation, this gold mask was removed. 
Then insulating layers, Permalloy™ patterns, and gold 
lines were deposited on the garnet to form sensors, 
nucleators, switches, stretchers, and the connecting con- 
trol lines. 

By 1975, there was also strong motivation to demon- 
strate much higher densities. A double-layer gamet struc- 
ture was developed [126] in which the EuTmYGaIG 
composition of the bottom layer (or storage layer) near 
the GGG substrate was selected for bubble stability and 
the GdTmYGaIG composition of the second layer (or 
drive layer) was selected to facilitate magnetic anisotropy 
sign reversal by ion implantation through a mask to create 
the desired propagation structures. These double-layer 
structures were used in the first demonstration of 1-pm 
bubble propagation and manipulation in contiguous-disk 
devices [127]. 

Ferrofluid studies by Lin, Dove, Schwarzl, and Shir 
[128] provided a detailed understanding of magnetic 
charged- wall mechanisms in the tridirectional anisotropic 
(111 ) garnets. The words "whip” and "flip” became part 
of the magnetic bubble lexicon in describing the behavior 
of charged walls. These studies facilitated a major design 
breakthrough in which switches used bubble striping 
between the major and minor loops [129] and crystallo- 
graphically phased current pulses to select the direction 
of transfer [130]. It also permitted the effective utilization 
of a novel self-aligned structure [131] in which the ion- 
implantation and the control-line masks were combined 
into one. 



Figure 5 Photograph of a contiguous-disk bubble storage re- 
gion with a 5 -fim propagation period and a density of 3.3 X 10® 
bits/cm’. A control current through the horizontal track is used 
to switch bubbles from the input track to the storage tracks in 
this self-aligned structure, described in Ref. 130. 


In 1979 Lin, Almasi, Keefe, and Pugh [130] reported 
the full-chip operation of contiguous-disk devices at 
150 kHz with better margins than any previous contigu- 
ous-disk devices. These devices utilized an easy-to-fabri- 
cate, self-aligned structure, l-pim bubbles, and a 5-pm 
period to achieve the highest-density magnetic bubble 
devices to that time (see Fig. 5). 

• Bubble lattice files j 

One of the most innovative ideas to emerge from IBM’s 
bubble work was the/proposal made in 1971 by Calhoun, 
Slonczewski, and Voegeli [132] to use regular, close- 
packed arrays of magnetic bubbles to store information. 
Such arrays of bubbles were observed during the 1960s 
[133], and their general properties such as bubble spacing, 
diameter, and energy had been described by other work- 
ers [134]. However, the idea that bubbles could be coded 
by means of their domain wall states and then manipulat- 
ed in an array to provide high-density storage had not 
previously been proposed. 
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Demonstration of bubble lattice propagation with mag- 
netic bubbles in a garnet was first achieved in January 
1974 and reported in 1975 [135]. This device also demon- 
strated the use of parallel magnetic stripe domains as 
buffer regions [136] to either side of the bubble array. The 
first published description of a complete bubble lattice 
storage device was provided by Voegeli, Calhoun, Rosi- 
er, and Slonczewski [137], It included the important 
concept of access channels proposed by Rosier [138] in 
which access channels were provided at selected posi- 
tions within the lattice area. The lattice was translated 
until a desired column of bubbles was in one of the 
channels. The column of bubbles was then propagated 
out of the array while new bubbles were inserted from the 
other end of the channel. 

Several methods for distinguishing among different 
domain wall states were proposed by IBM workers. The 
method finally implemented was to utilize the different 
deflection angles observed among bubbles with different 
wall states as proposed by Slonczewski [139], The first 
experimental observation that bubbles with a small num- 
ber of Bloch line pairs moved at fixed angles to a field 
gradient was reported in 1973 by Slonczewski, Maloze- 
moff, and Voegeli [140], The deflection angles were found 
to be proportional to the 5 number or “state" of the 
bubble, defined as the number of rotations of the magneti- 
zation in the bubble wall during a complete circuit of the 
circumference of the bubble. 

Experimental work by Hsu [141] showed that two 
bubble states can coexist in bubble films in which the 
magnetization is exchange-coupled to an in-plane layer. 
When the S = I -state bubble is propagated in a magnetic 
field gradient, it moves at an angle of 5° to 30° from the 
field gradient, depending on the material. The 5 = 0 
bubbles propagate parallel to the field gradient. Hsu also 
showed that appropriate application of in-plane fields 
could be used to switch the bubble from 5 = 0 to 5 = 1 or 
vice versa. This work verified the operability of Slonc- 
zewski's coding scheme and provided the basis for subse- 
quent studies of wall states in bubble lattice devices. 

Beaulieu et al. [142] identified eleven transition mecha- 
nisms between seven different wall structures of which 
four structures corresponded to 5 = 1, two to 5 = 1/2, 
and one to 5 = 0. The discovery by Brown and Hsu [143] 
of better state stability in SmCaGe than in the originally 
used YEu gallium-iron garnets was of considerable im- 
portance to device work. 

All of these concepts were tested successfully in a fully 
functional bubble lattice storage device of 1000 bits in 
1978 [144], Although the demonstration vehicle was 


judged to be too complex for commercialization, it had ? 
proved that all of the concepts could be made to work 
simultaneously on the same garnet chip. 

Potentially simpler devices had already been proposed : 
by several IBM workers in which bubble array propaga- 
tion was affected by an external rotating field and Permal- 
loy™ structures similar to those used in conventional ■■ 
bubble devices [145], The most successful test of these 
concepts was obtained with a 14 000-bit array by Brown 
el at. [146]. These results showed that the major-minor 
loop architecture and symmetric Permalloy™ chevron 
propagation structures of conventional bubble devices 
could be combined with wall-state coding to achieve very 
high storage densities. 

• Future directions 

Several companies have begun to ship "conventional” j 
magnetic bubble products with up to one million bits per 
chip. This is 16 times more bits per chip than is presently ! 
available in any semiconductor memory product. Never- " i 
theless, the projected costs per bit are not yet sufficiently.,-^ 
lower than those projected for higher-performance semi- • 
conductor memories to ensure adequate sales for magnet- 
ic bubble devices. Further advances in conventional : 
bubble technology or a shift to more exotic designs 
appear to be needed. Contiguous-disk and bubble lattice '£! 
file devices are still in the research stage; however, work 1 
on contiguous-disk devices has been initiated in a number I 
of laboratories, suggesting that an increased rate of 
progress can be expected. 

In any event, semiconductors are expected to maintain 
a dominant role in memory technology for the foreseeable : 
future. They provide higher performance and, in very ; 
small memories, a lower cost than any magnetic bubble £ 
device technology proposed thus far. 
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Semiconductor Logic Technology in IBM 


In the last twenty-five years the performance of integrated circuits has improved by more than three orders of magnitude 
and the unit cost has been reduced by approximately the same factor. This paper describes the evolution of 
semiconductor logic technology in IBM from its early replacement of vacuum tubes in the mid-1950s to the beginnings of 
VLSI. It highlights the major challenges and accomplishments in the development of bipolar and field-effect transistor 
technologies and their embodiment in components for a wide spectrum of IBM products. 


Introduction 

Ultimately, the purpose of all the creativity and toil 
applied to the development and manufacture of the elec- 
tronics for data processing systems can be reduced to a 
few key objectives: 

• to improve the performance— reduce signal handling 
and/or propagation time; 

• to lower the cost— including packaging, power, and 
cooling costs; 

• to enhance the reliability— decrease the failure rate and 
increase length of life; 

• and to shorten the development/manufacturing cycle- 
marketing considerations often demand minimum lapse 
of time between product definition and its availability. 

The quest for improved computer performance and 
reliability led to the replacement of relays by vacuum 
tubes and solid state germanium diodes in the early 1950s. 
The purpose of the vacuum tube was to provide signal 
amplification, to perform the logical invert function, and 
to drive the long cables between computer I/O (input/ 
output) equipment and the main frame, and or OR logic 
was performed with point-contact germanium diodes. 
Transition to all solid state circuits was completed by the 
mid-1950s, driven by the rapid development of the alloy 
junction bipolar transistor. 

Even though the early transistors were hardly cost- 
competitive with vacuum tubes, they had several intrinsic 


advantages over vacuum tubes in digital circuit applica- 
tions. Solid state devices have a much lower failure rate 
and a significantly longer life. The operating voltages are 
lower, and the absence of high-temperature cathodes 
results in lower power dissipation and in appreciably 
lower operating temperatures. Denser packaging is hence 
possible with the same or even reduced cooling require- 
ments. Denser packaging permits shorter interconnec- 
tions which reduce capacitive loading and/or signal prop- 
agation delays, thus contributing to improved perform- 
ance. 

Since the first use of transistors in the mid-1950s, IBM 
scientists and engineers have originated many revolution- 
ary changes in structure, design techniques, packaging, 
manufacturing, and testing of solid state devices and 
integrated circuits; and they have invented several impor- 
tant device and circuit configurations. The performance 
(delay) has been improved by more than a thousand 
times, and the unit cost has been reduced by about the 
same factor. 

During the late (950s and early 1960s, when the per- 
formance limits of transistor logic circuits were still being 
explored, two other competing approaches were studied: 
tunnel (Esaki) diodes and parametric amplifiers. For 
amplification, both relied on a negative resistance phe- 
nomenon, inherent to the tunnel diode and induced in the 
parametric amplifier by a “pump” of double frequency. 
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accepts files sent to it by other users of the VM/370 
system or by another computer. Using a unique identifier 
for each computer system, RSCS then selects a communi- 
cation link and sends the file to its counterpart on the next 
system, which repeats this process until the file reaches 
the destination. The two-tier address structure, system 
and user, used by RSCS currently addresses within IBM 
an estimated 50 000 users of more than 400 systems 
spanning five continents. 

Summary 

VM/370 has evolved into an operating system which, 
judging by its use and popularity, has some merit. How- 
ever, it is only a small step toward the goal of making the 
use of computers easy and convenient. Although, as with 
computing in general, it is impossible to step back the 
clock even a few years without realizing the progress 
which has been made, many of us do not appreciate how 
far we have come because we can see how far there is yet 
to go. Incorporation of high-level function within the 
hardware, easy to use hardware and software for less 
specialized user groups, and evolutionary growth comple- 
mentary to the future's distributed computer and informa- 
tion utility all seem possible within the original CP/CMS 
design, but remain to be done. 

The Control Program provides an environment for the 
operation of established systems and for the evolution of 
new systems and hardware within the multiple machine 
architecture. The ability to run several operating systems 
at the same time, both for function and convenience, 
appears to be a key element in the acceptance of VM/370. 
Each operating system, like CMS, RSCS, or OS/VS, 
provides important capabilities for an installation. Test 
and production systems can coexist to smooth the transi- 
tion to a new release. Specialized and experimental 
systems can be developed conveniently without the dis- 
ruption of normal work. 
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The history of memory management technology in IBM during the period between the 1950s and the early 70s is discussed 
in this paper. The paper concentrates on the programming and operating system aspects of the problem, rather than the 

hardware technology involved. 
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Introduction 

The speed at which digital computers operate depends 
heavily upon how fast the processors can get data and 
instructions from memory. Over the past quarter century, 
the internal organization of computers has become in- 
creasingly sophisticated as a result of efforts to make data 
more readily accessible to the central processing unit. In 
parallel with this engineering activity, systems program- 
mers, installation managers, and computer scientists have 
been striving toward the same goal. Their effort, to 
optimize the processor's access to information, has come 
to be called memory management. In this paper we 
review the evolution of memory management in IBM, 
particularly as it concerns and affects programming. 

In order to present some ways of managing memory, 
we first need a model on which to define basic objects and 
concepts. Data and programs are stored, usually in binary 
form, in a memory subsystem. On early computers, the 
memory subsystem was a single main memory. Comput- 
ers became faster and computer problems larger, but a 
single main memory that was both fast enough and large 
enough had not really been available. This led to a 
memory subsystem organization consisting of a set of 
devices, typically consisting of a small fast main memory 
for the immediate needs of the processor and some larger, 
slower devices holding data not expected to be required 
soon. These devices are usually arranged in a hierarchy 
and are interconnected so that data can be moved about 
independent of the processing of other data. 
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Thus our simple model, or abstraction, consists of a 
processor and a memory subsystem, with information 
flowing between them. The processor works cyclically, 
and at the completion of almost every cycle, a specified 
piece of information is sent to or requested of the memory 
subsystem. The memory subsystem then accomplishes 
the task with some delay. The following questions imme- 
diately arise: (1) How is the information piece specified? 
(2) How large are the pieces? (3) How rapid is the 
response of the memory subsystem? 

Question (1) is that of addressing, which can be per- 
formed in either of two fundamental ways: by content or 
by location. In the first, the requested information is 
found by (partial) matching, as in the process of finding a 
telephone number in a list in which each number is next to 
a subscriber’s name. (Notice that the name-number pairs 
do not have to-be stored in any order.) An example of the 
second way of addressing is the looking up of an article in 
a book by using the table of contents, which gives, in 
numerical order, the starting P a 8 e number of each article. 
Due to its predominance, in our paper the latter, location- 
based or coordinate addressing is assumed, and the 
extent of addressability is called an address space. 

I 

Question (2) is not an issue since we introduce a 
simplifying assumption: An address always refers to a 
portion of memory whose size is constant in a given 
context in this paper. 
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Question (3) relates computer performance to memory 
management. The time required for the memory subsys- 
tem to deliver (or store) a piece of information depends on 
the arrangement of the devices in the subsystem, as well 
as the arrangement of the programs and data on those 
devices. The result is that the rate at which the processor 
operates is not uniform; periods of full speed execution 
are interspersed with periods of waiting. 

• Memory mnnaycownl schemes 

One way to reduce if not eliminate idle processor periods 
is for the programmer to anticipate its demand for new 
information. Thus, new data are brought into main memo- 
ry concurrently with, but independently of. processing. A 
second way. called multiprogramming, is to keep several 
independent programs in memory concurrently. When 
one program encounters a delay in the memory subsys- 
tem. the processor can be switched to another program 
which was previously delayed but is now ready to run. 
We defer till later discussion of multiprogramming. For 
both schemes, however, the idea is to overlap the time 
required to move data in the subsystem with some other 
processing activity. 

Until about fifteen years ago. computers only ran one 
program at a time, and the overlapping of computation 
with inpudoutput activities was considered a minor task, 
to be easily included in the program by the programmer. 
At that time, both the programmer and the machine 
designer assumed that the computational aspects of com- 
puting were predominant, with the programmer concen- 
trating on algorithms and their representations, not on 
data, and the designer on fast processors, not on memory 
subsystems. It soon became apparent, however, that the 
problems of complex program and data structures, often 
far larger than main memory, were anything but a minor 
task for the programmer. 

In response, system programs handling memory man- 
agement began to appear. Many of these solutions were 
also supported by innovation in hardware design. Slowly, 
a basic dilemma became crystallized: how to make a 
program containing only functions relevant to the prob- 
lem it is intended to solve, yet assuring efficient use of 
machine resources. 

One way that this problem was dealt with was to 
identify the separate memory management functions and 
to provide system programs to perform them. These must 
allow the programmer to work only with the logical 
structure of data, rather than the way the information 
pieces are arranged in main memory. The decomposition 
of memory management results in subtasks such as the 
following: 


• Arrange data and program segments within a single - 
linear address space. 

• Manage the capacity limitation of this address space. 

• Manage the trade-offs introduced by the cost of differ- 

ent devices in the memory hierarchy. ^ 

• Manage the capacity variations introduced by the idio- 
syncrasies of hardware: nonstandard sizes of informa- 
tion pieces (words, records, etc.) and changes in systemgg 

configurations. 

As we see, the situation has become far more compli- tr- 
eated than it appeared in our simple model. Not surpris- 
ingly. a quite sizable and growing community of research- c, 
ers and system designers has been involved in solving the 
problems. IBM has been one of the leaders in its effort to ■. 
combine the many component solutions in a systematic^-; 
way and to integrate them into practical systems. Howev- 
er, few of these component solutions have been reported^- 
in the literature, and their origins are almost impossible to .- 
trace, a situation which is reflected in this paper. Jg 

In the next section we discuss how memory manage- £ 
ment is handled beginning with program preparation and 5- 
continuing through program execution. Greatest empha- ^ 
sis is on execution-time memory management, that typi-p 
cally done by operating systems. This topic is further ' A 
elaborated in the following section, which is. in turn, ^ 
followed by sections on the approaches to memory man- 
agement taken in several particular IBM operating sys-* 


The four phases of memory management 

The solutions to the problems of memory management, | 
are distributed (with some overlap) over several phases: >| 
structuring the program (program design), translating the | 
program from a programming language to machine code 
(compilation), presenting the program material to the . 
machine (link-editing), and executing the program (dy - ^ 
namic memory management). 

• Program design 

In the program design stage decisions must be madel 
about the organization of the program and its data mto| 
subroutines, control tables, work areas, and buffers. Key ; 
elements of this activity include planning of storageJ 
occupancy, storage requirements during execution, link- 5 
age among program elements, and availability of data at 
the proper times. However, programs and their data 
cannot always be arranged linearly with respect to each . 
other. Also, as execution progresses, programs and data 
that are no longer required in memory can be removed 
and the space occupied by them made available for, 
occupation by others. 
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The collection of programs that control the execution 
of the processor and to a large degree effect memory 
management became a part of what was first called a 
'•monitor" and later an "operating system.” The func- 
tions of an operating system can usually be separated into 
two parts: (1) those providing services to individual user 
programs; (2) those applying some particular policy {e.n . , 
job priority) to each program with respect to its use of 
shared or limited resources. Since a principal objective of 
an operating system — keeping the processor busy running 
programs— rests on having the programs and their data in 
memory, memory management is central to its design. 
However, since there are memory management aspects 
to each of the three other phases of program preparation, 
and since these three phases span the time between 
creating and running the program, the following questions 
arose: Who is responsible for memory management? 
How can memory management requirements be ex- 
pressed so they are consistently stated at each phase? 
Can the whole problem be handled automatically? By the 
end of the 1950s. this last question had become the key 
issue in design of operating systems. 

By 1961 [2] this issue came to be characterized as "pre- 
planning versus dynamic storage allocation.” Pre-plan- 
ning aimed at maximum run-time efficiency; it entailed 
greater human effort and required more sophisticated 
programmer tools. Dynamic storage allocation aimed at 
increasing programmer productivity even at the possible 
expense of run-time efficiency. Both required innovative 
techniques for execution-time memory management, for 
with either the system designer had to confront the limited 
size of memory and the contiguity of its addresses. 

In June 1961, at the ACM Dynamic Storage Allocation 
Symposium [3], the developers of the ATLAS computer 
from Manchester, England, described a hardware tech- 
nique termed the one-level storage system. This hardware 
allowed the address of a data item to be treated, not as its 
coordinate (i.e. , location), but as its identifier. Only when 
the processor requested the data at an "address” was it 
necessary to determine the memory coordinates of that 
data. Termed "dynamic address translation," this con- 
ceptually simple extension of computer addressing gave 
the operating system designer a means to attack all three 
parts of the memory management problem: Program 
addresses could be larger than real memory addresses; 
they could remain consecutive, even though the associat- 
ed main memory addresses were not; and allocation of 
programs to main memory could be changed dynamical- 
ly— even during execution of the program. To some 
extent offsetting these advantages was the fact that the 
translation hardware added complexity (and cost) to the 
processor and, in general, caused it to run slower. 
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Until 1972, IBM supported two relatively independent^ 
approaches to operating system organization. The main 
line of support managed program execution in the rerf'’’ 
memory of the system. DOS/360 and OS/360 were the two 
IBM operating system families in this line. Examples are 
drawn from OS/360 to demonstrate the progress made in 
memory management. Designed for good run time speed 
of production (r.e., polished, frequently run) programs! ' 
OS/360 provided both the programmer and the system 
operator with a large selection of memory management'' 
capabilities that permitted maximizing both program and* 
system efficiency. The other approach to operating sys?’ 
tern organization exploited dynamic address translation 3 ' ' 
and aimed at providing an execution-time environment? 
for programs with highly dynamic memory requirement^^ 
Often characterized as "time-sharing" systems, these 
latter systems allowed the determination of memory! 
requirements to be deferred until a user entered a com- 

In the following sections we describe first a few repre-fi 
sentative stages in the development of the "real member 
ry" based operating systems and then some develop-??; 
ments in virtual storage based systems. We finally consid-? 
er the merging of these two independent lines into the?, 
most recently available IBM product— MVS. 

Conventional memory management— Stretch 

An early, rudimentary attempt at the management of real"' : 
storage among independent users is found in project/!; 
STEM (STretch Experiment in Multiprogramming) in the % 
late 1950s [4], This was a prototype batch-job multipnw|| 
gramming system for the IBM 7030 computer. Its design 
emphasized strategies for scheduling {i.e., selecting jobSjju 
to be run. placing them in memory, and getting them%? 

started ) and dispatching {i.e., selecting which job ill:; 
memory is to be assigned next to the processor) that A 
provide for the efficient use of the central processor. The^E 
goal was to be achieved by attaining the maximum** 
multiprogramming level. The memory requirement of a,^ 
job was a key parameter in its scheduling, and each job < ff 
included a declaration of its maximum memory require-,, 
ment, which was allocated to it by the system. Whatever® 
memory management functions were required within this 
allocated space had to be incorporated into the appliraVwS 
tion program itself. 

A second key input parameter to the job scheduler was^® 
the expected running time of the job; jobs were scheduled 
based on this estimate. The job expected to run the 
longest was placed at one extreme of memory; the nex 
longest running job was placed at the other extreme 
Scheduling proceeded in this manner, with short runnl 
jobs in the middle and long running jobs at the extrem 
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The dispatching strategy was to attempt to complete the 
center, shorter running jobs first. Thus, on two succes- 
sive job completions the two blocks of storage that were 
relinquished would yield a single block. Ideally, there 
would be no storage fragmentation. (See Fig. 1.) 

Dynamic real memory management— OS/360 

The several versions of OS/360 provide many good exam- 
ples of the progress made in providing generalized aids 
for the memory management problem in the real memory 
operating systems. Prior to DOS/360 and OS/360 magnet- 
ic tape was the principal medium for secondary storage, 
but its sequential nature limited its possibilities for memo- 
ry management. The arrival of disk storage as an econom- 
ical second-level storage with good random access capa- 
bilities was the catalyst for a new approach to memory 
management. Central to this was the concept of a pro- 
gram as a set of separate pieces each of which could be 
brought, when needed, from secondary to main storage. 

Some of the earliest programming aids were based on 
the concept of forming a program by binding together 
separately named pieces of source code and data areas 
which were drawn from libraries. These libraries were 
later pre-compiled, which not only saved the compilation 
step for standard modules but also allowed deferring until 
load time the selection of specific standard modules to 
use for a particular execution. In the design of OS/360 it 
was decided not only to support compile-time and load- 
time binding but, by taking advantage of random access 
disk storage, to provide execution-time binding as well, 
in OS/360 a set of services provide the link between the 
application program and the program modules on direct 
access storage. For example, during execution, a pro- 
gram can request the appropriate subroutine; only then 
will it be brought from secondary storage. This defers 
main memory occupancy until it is actually needed and 
also allows the selection of the subroutine to be based on 
the data at hand. Execution-time binding is a particularly 
important feature of OS/360 for two reasons: first, be- 
cause this capability offers the potential of reducing the 
maximum amount of storage required by a program 
during its execution without the preplanning becoming 
hopelessly complex, and second, because the operating 
system itself can use it to great advantage. This is 
because, just as with the application program, the operat- 
ing system requires subroutines and data areas. 

It is fundamental to the design of OS/360 that the 
management of the main memory requirements of the 
system, from the application program right down to the 
innermost services of the operating system itself, had to 
be done consistently. To achieve this, OS/360 was de- 
signed to include a single set of services which allocate 



Figure 1 A scheduled job period in project STEM; jobs were 
scheduled in their alphabetical order. 


main memory to a program on demand, take it back when 
no longer required, and do all the required housekeeping. 
These services had to maintain lists of free and occupied 
space and to associate all occupied space with the appro- 
priate program. Efficient algorithms were devised to 
make free space available in response to specific re- 
quests, including the fitting and grouping necessary to 
minimize fragmentation. These routines, called oetmain 
and FREEMAIN, are available to all programs within the 
operating system. 

Before proceeding further with the description of mem- 
ory management in OS/360, let us refine slightly this 
concept of a program as a set of pieces of code/data/ 
memory. The application program is made up of a "main- 
line” program and subroutines, some of which are specif- 
ic to the application and some of which come from 
system-wide libraries. In addition, the program calls upon 
system services (such as access to the printer). This is 
shown schematically in Fig. 2. The important points to be 
made are that these pieces are not necessarily in main 
memory at the same time, that they may be shared by 
several programs, and' that some of them were not called 
for directly by the program but brought in on behalf of it 
by a system service. The possibilities are almost endless, 
particularly if several programs are sharing subroutines. 

Thus the program can be thought of as a network of 

named pieces. At any one time some of these pieces are in 
main memory, forming, in effect, a subnet of the program 
network. The memory management part of the operating 
system must keep track of this subnet and provide enough 
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Figure 2 Example of an application program composed of a 
main-line program, subroutines from various libraries, and over- 
lays. 


information so that as program execution progresses the 
proper program subnet is always in memory. Other 
memory management functions include such things as 
making sure that, when a job is completed, all memory 
pieces have been returned and as identifying portions to 
reclaim if the program is abruptly stopped. 

Before disk storage was introduced and the overlay 
structure of a program was stored on tape, program 
execution tended to follow the sequence of segments as 
they appeared on the tape. OS/360 took advantage of disk 
storage devices to provide a random access program 
library and, thereby, to provide a very general implemen- 
tation of the overlay technique. Segments could be called 
and recalled in pretty much any sequence. 

As discussed above, an important design objective of a 
memory overlay system is that the program modules need 
not be recompiled to reflect the overlay segment struc- 
ture. That is, we write and compile modules— which call 
other modules as necessary. Next we form the compiled 
modules into segments. In OS/360, the “program” which 
manipulated program modules to form segments was 
called the linkage editor. It resolved the intra-segment 
calls between modules; the inter-segment calls were 
converted into calls to the overlay supervisor to bring 
another segment into memory. Several segments could be 
496 combined to form a larger segment, or a segment could be 
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broken into several smaller segments (provided, of 
course, that it was composed of several modules). In 
addition to providing these means of manipulating seg- 
ments and modules, the linkage editor also provided aid 
in forming a correct structure by automatically position- 
ing common data areas at the proper place in the memory 
structure so as to be addressable by all modules requiring 
access to them. 

As flexible and general as this implementation is 
however, an overlay structure is not sufficient to support 
the execution dynamics of some programs. The overlay is 
based on a preplanned arrangement of programs into 
segments, and some programs, particularly those that 
serve terminal users, do not admit to such preplanning. 

OS/360 also provided facilities for more dynamic as- 
signment of programs to memory by supporting three 
types of supervisor-assisted linkages. Each is based on a 
different assumption about the memory residence of the 
calling and called program. One service, link, is a! 
dynamic subroutine call for which both the called and 
calling programs are to be in memory at the same time. A 
second service, called Transfer Control (xctl), allows'! 
program to call another while relinquishing the storage 
held by the caller. A third service, attach, allows a 
program to establish the execution of another program 
concurrently with its own execution. This permits the 
application program to establish its own multitasking 
environment. All of these facilities eased the managemen 
of the linear nature of memory and scheduling its coii-; 
tents, and they took over much of the housekeeping that 
would otherwise have to be part of the application 


A final aspect of memory management in OS/360 is the 
device independence of the program libraries. Subpro- 
grams could reside permanently on any device in the 
memory subsystem— including main memory. This al- 
lowed improved system performance by placing frequent- 
ly used routines in main memory: No input-output activi- 
ty would be necessary to make these subprograms avail 
able to the caller. This area of memory is called the Li 
Pack Area. 


1 


i 


The evolution through the various versions of OS/360 
and OS/370 can be viewed as an evolution in memory 
management strategies. The principal design objective of 
OS/360 was, from the first, that it be a generalized 
multiprogramming operating system. Early developments - 
in its evolution were aimed at meeting this objective in the 
best ways possible consistent with schedule and resource* 
constraints. Later, on-line environments, such as telepr 
cessing applications and timesharing, were added to the 
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design objectives. These new objectives required exten- 
sion to the memory management services of the system. 

The first version of OS/360 was made available in 1966. 
Called the Primary Control Program (PCP), it was a single 
job-at-a-time operating system. MFT (Multiprogramming 
with a Fixed number of Tasks) was released the following 
year in order to provide a minimum multiprogramming 
facility. The limited capability of these early versions of 
OS was to a large extent the result of limited memory 
management. The next major evolutionary step-MVT 
(Multiprogramming with a Variable number of Tasks) 
was to provide for variably sized job regions. In MVT, 
the memory requirements for the job or job step were 
specified in job control statements. The scheduler re- 
quested the necessary memory to schedule a job and, if 
not enough were available, the job was enqueued, waiting 
for the resource. When memory became available, the 
scheduler would be started in this dynamically assigned 
space. On an installation level, care had to be exercised 
not only over the region size requested but also over 
whether these requests were made for an entire job or a 
job step in order to avoid serious fragmentation effects. 
And the application programmer still had to be generally 
aware of the maximum addressable storage required by 
his program and the system services that were used by it. 

MFT-1I was introduced in 1968 to provide generalized 
multiprogramming on computers that were too small to 
run MVT efficiently. MFT-1I provided several fixed parti- 
tions into which jobs could be scheduled dynamically. It 
allowed for small partitions which were not large enough 
to contain the scheduler code itself. These were sched- 
uled from larger partitions when one of them was free 
(between job steps). This could result in some inefficiency 
but did provide a smaller system than the more elaborate 
MVT system. By way of contrast, to schedule a small job 
in MVT, the scheduler obtained enough space for itself 
and, when it transferred control to the job it had sched- 
uled. the extra memory it had required for itself was 
released for re-use by the system. 

The next major development was in support of the 
time-shared use of memory for the Time Sharing Option 
(TSO) of OS/360. Since it was impossibly inefficient to 
dedicate physical storage to a program interacting with a 
user at a terminal, the function of swapping was added to 
the system. This allowed OS/360 to time-share a portion 
of addressable memory among multiple users by keeping 
on external storage the contents of that portion of memo- 
ry assigned for each user and bringing it back in only 
when that user was allowed use of the time-shared region. 
It was also possible to have more than one such region. 
However, as a consequence of the binding of programs to 


real addresses, each terminal user was locked into the 
region to which he was initially assigned, regardless of 
how the time-sharing user load shifted among regions 
over time. 

Virtual storage memory management 

Recall from the previous discussion the notion of address- 
as-location and address-as-identifier. This distinction, 
and the hardware that supports it, has given rise to 
address relocation, dynamic address translation, paging, 
and virtual storage [5, 6], In all of these a distinction is 
made between what we call a “logical” (or “virtual”) and 
a “real" address. Logical addresses are those defined by 
(and in some sense defining) the program to be run, and 
real addresses are those defined'by (and defining) physi- 
cal storage. The translation of logical addresses into real 
addresses ("dynamic address translation” or "reloca- 
tion”) is performed by the CPU as data in memory are 
needed. This translation (or mapping) can take any of 
several forms. 

The simplest mapping function (called single register 
relocate) adds a constant to the logical address and 
compares the sum to some limit. This map allows several 
logical address spaces to exist in main memory at once, 
but it has the limitation that the maximum logical address 
can be no larger than the maximum physical address. 

This simple address translation technique was part of 
the special modifications made to the IBM 7094 for MIT s 
Compatible Time Sharing System [7], (See subsequent 
section “Early time-sharing and the MIT-RPQ.”) It was 
also incorporated into both the Models 135 and 145 of 
System/370 [8] to support a logical address space for 
DOS/360 to co-exist with another system. 

The ATLAS project provided a more elaborate map- 
ping system. In ATLAS, the main memory of 16K words 
was divided into 64 blocks of 256 words each. Associated 
with each block was a “Page Address Register, which 
contained the virtual address of the 256-word portion of 
virtual memory that was in that block of main memory. 
Virtual addresses were translated to real addresses by 
taking the page address part of the address (that is, all but 
the least significant 8 bits) and comparing it with the 
contents of all 64 Page Address Registers. If a match was 
made, say in register no. 9, then the ninth page of mam 
memory must have contained the data being requested. If 
no match was found, then that virtual address was not in 
main memory and had to be brought in from secondary 
Storage. - ~~ -r— r-- ‘ ~ — 

ATLAS allowed several logical address spaces to re- 
side concurrently in main memory. It also allowed por- 
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Figure 3 The virtual-to-real address translation used in the 
M44. 


tions of virtual memory to be absent from main memory. 
This aspect of dynamic address translation gives rise to 
“demand-paging.” When an absent page of virtual mem- 
ory is referred to, a “page fault” occurs and the memory 
management part of the system must move the requested 
page of virtual memory to main memory and update the 
mapping hardware to reflect this change. Address transla- 
tion hardware can be used to support paging only if a page 
fault is detected early enough in the instruction execution 
cycle, as illustrated later in the case of the Virtual 
Machine Control Program, CP/40. If the page fault occurs 
after instruction execution has progressed too far to 
“back up” (or if the machine is not organized so that 
instruction execution can be stopped and restarted in 
mid-cycle), then demand-paging is not possible. This is 
not to say that address translation cannot be employed, 
only that the memory management part of the operating 
system must be capable of enough “pre-planning” to 
ensure that page faults will not happen. 

In the following sections we discuss how this dynamic 
address translation became generalized into paging sys- 
tems, and many other memory management tools. The 
central idea is that address translation allows system 
cost /performance design problems to be compartmental- 
ized. Thus, broadly viewed, the problem programmer 
manages things (program and data) in a space of logical 
addresses. The operating system manages the real storage 
of the computer by maintaining the mapping function. A 
characteristic of a computer system, including its operat- 
ing system, is the extent to which these separate problem 
areas can be handled separately and concurrently. 

Early time-sharing and the MIT-RPQ 

Starting in the 1940s, with its development of the 
WHIRLWIND computer, MIT was involved in the devel- 
498 opment of computers and significant extensions to com- 


puter system organization. In the late 1950s the notion of 
interactive access (or time-sharing) was being discussed 
[9], and by the early 1960s MIT had evolved a design fora 
Compatible Time-Sharing System (CTSS). The goal set 
for CTSS [7] was: ”... drastically to increase the rate of 
interaction between the programmer and the computer 
without large economic loss and also to make each 
interaction more meaningful by extensive and complex / 
system programming to assist in the man-computer com- 
munication.” Given the great discrepancy between hu- V? 
man and machine processing speeds, it was felt that the 
program material for a given on-line user need appear in -f 
the execution memory only burst-wise, and between such 
bursts human thinking would take place. Memory man- jl 
agement was central to realizing this goal. 

I 

IBM worked with the MIT staff to modify their 7090/94 $ 
processor. The 7090/94 was limited to 32K 36-bit words of 7 
memory, which, though adequate for many user prob- 
lems, was not enough for several active user programs ‘i 
and the sort of system support that the staff at MIT.,i 
envisioned. Additional memory as well as a means of . 
swapping the active user's program into main memory for j, 
its "burst” of execution were required. IBM also worked v 
with MIT to devise several extensions to the 7090 to allow 
it to support interactive access and swapping of programs y 
into (and out of) main memory. Thus, in addition to a •' 
second main memory unit (32K words), address reloca-J 
tion, checking, and protection hardware were developed -i! 
and delivery to MIT was completed by 1963. 

The additional memory unit, termed "core-B," was 
added to the 7090/94 in such a way that the processor 
executed out of either core-A or core-B. The CTSS 
operating system resided in core-A, and user programs , 
were moved into and out of core-B. In order to allow 
several users to co-reside without interfering with each 
other, memory was "protected" in units of 256-word 
blocks. CTSS services allowed the user programs to set 
their address space “size" of up to 32K in units of 256 
words. During the bursts of execution, user address space 
from location 0 to "size” (/V»256)-l was in real memory ( 
at locations 0 to (/V* 256)- 1 in core-B. Locations at and 
greater than (N»256) were protected by the hardware and 
thus could hold the "upper” portions of the address.; 
spaces of other users. This came to be called an "onion-, I 
skin" algorithm. 


The M44 system 

Between 1961 and 1964 the IBM Thomas J. Watson 
Research Center at Yorktown Heights, NY, undertook a 
systematic study of the memory usage patterns of 7090 
programs [10]. The intent of this work was to learn about 
the behavior of these programs in a paging environment. | 
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In order to study a program one must first interpretively 
execute the program and then store on magnetic tape the 
sequence of memory addresses it just generated. The 
resulting "address traces" can subsequently be used as 
input to a paging simulator program. To test program 
behavior in a paging environment, the simulator in fact 
allowed page size, real memory size, and page replace- 
ment algorithm to be varied. As an example, for different 
replacement algorithms, the number of page replace- 
ments which had to take place in main memory to run a 
program were counted and displayed as a function of page 
Size and (restricted) memory size supplied. The details of 
these studies can be found in [11]. 

These studies created a wealth of insight and, inciden- 
tally. led to discovery of the principle of locality [11], 
Nevertheless, simulation could not be the whole story, 
and by 1964 it became clear that full-scale experimenta- 
tion was necessary to test the relation of memory man- 
agement to some of the ideas which were emerging at that 
time- time sharing, multiprogramming, operating system 
structures, simulated or "virtual” machines. An IBM 
7044 was chosen as the experimental vehicle. It was 
modified to support not one but many alternative designs 
by allowing variable parameter values for page size, 
memory size, level of multiprogramming, etc. 

In order to modify the 7044 into what later became 
known as the M44. the address field had to be extended, 
in all instructions with such a field, from 15 bits to 21 bits 
in length. This made two million words addressable. Of 
course, the index registers and other hardware involved 
in addressing were also enlarged. Dynamic address trans- 
lation was hardware-aided, using a dedicated "mapping 
memory" of 32K words. The high order position of each 
processor-generated "virtual” address was used to locate 
a word in the mapping memory, where either the “real 
address of the page containing the desired memory cell 
could be found or else a "page exception" was generated, 
i.e . , an entry to system programs which then acted to 
bring in the missing page from some back-up memory (see 
Fig. 3). Actually, the mapping was somewhat more 
complex, due to the experimental nature of the machine. 
For example, a mechanism was included to permit the 
machine to perform with any page size between 256 and 
4096 words that was an integer power of two. 

But what about multiprogramming using multiple ad- 
dress spaces? With large pages, smaller mapping tables 
were needed, and thus only a fraction of the mapping 
memory was utilized. This made possible the storing of 
mapping information about several address spaced or 
virtual machines. Correspondingly, an additional register 
for "user ID" was installed; it was set whenever a new 


user's program was dispatched. Its contents were used as 
an offset in accessing the mapping memory which con- 
tained address translation data for several virtual ma- 
chines, as indicated in Fig. 3. With a 2048-word page size, 
for example, sixteen independent so-called "44X virtual 
machines” could share the physical resources. 


Another useful thing about the M44 hardware was that 
a single 36-bit word in the mapping memory could hold 
much more information than a real page address. Corre- 
spondingly, several bits called status flags were used to 
designate certain pages as read-only, privileged, transfer- 
protected, etc., thus packaging information on capabili- 
ties, protection, and address transformation in a single 
structure and making the operating system simpler. 

The operating system, of course, had to perform many 
other functions, including the management of secondary 
storage devices, such as disks. All these functions were to 
achieve device independence for programmers such that 
they need not manage the details and limitations of 
physical devices. Programmers, for example, could be 
given only the description of a machine with virtual 
memory and other abstract resources. As the programs in 
execution issued requests for the resources, the operating 
system translated these into requests for real devices. In 
experimentation, the M44/44X system verified the useful- 
ness of these ideas [12]. 

But experimentation helped in many other ways. It was 
observed, for example, that under certain circumstances 
a program will run faster with fewer pages assigned to it 
[13], Another discovery was the nonlinear effect of pro- 
gram localities; The decrease in processor idleness is 
generally more abrupt than the increase of assigned 
memory space which caused it. It was indeed experimen- 
tally verified that dynamically varying memory space 
could be superior to static allocation. 

Significant progress in system performance methods 
was also facilitated by experimentation. At one time a 
programmed control loop was added to the operating 
system; it sensed the page traffic intensity between main 
and back-up memories and the fraction of time the 
processor was idle. With both variables high, the number 
of multiprogrammed tasks was reduced by one until 
congestion eased. The rationale was to allow more aver- 
age main memory to accommodate the natural locality o 
each participating program. This dynamic memory-space 
sharing was then monitored graphically to further the 
insights gained by the system designers [14]: 
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of memory. The programmer (or the compiler) could, at 
any point in the program, insert one or the other of two 
special instructions (actually supervisory calls): "named 
page soon needed" or "named page not needed." While 
this violated device independence, the supplied informa- 
tion was potentially very useful to the operating system in 
its effort to maximize throughput. The instructions were, 
incidentally, nut taken as mandatory by the operating 
system: hence the loose coupling. 

After having served as an experimental vehicle for 
more than three years, the M44 system was dismantled in 
1968. following the original plan. 

Virtual machine systems — CP/40, CP/67, and VM/370 

In [%4 the staff at IBM's Cambridge Scientific Center 
undertook a project intended partly to investigate pro- 
gram and machine structures for an interactive system 
[15]: "Central to the idea of this system is the concept of 
the virtual machine' and. in our case, the 'virtual 360.' " 
This work led first, in 1966, to a virtual Machine Control 
Program (CP/40). This served as a prototype, demonstrat- 
ing feasibility, and provided the basis for CP 67, an 
operating system for the System/360 Model 67. and in 
1972. VM/370. an operating system for the System/370 
machines. The history of the development of this family 
of operating systems is treated separately in this issue in 
the paper by R. S. Creasy [16]. 

Of interest here is the dynamic address translation 
technique implemented experimentally in 1965 on a Sys- 
lem/360 Model 40 [17], Resembling closely that devel- 
oped by (he ATLAS group, the mapping system was quite 
simple. For each 4K-byte page frame of real main memo- 
ry. there was a 16-bit register which contained the identi- 
fier of the page of virtual memory that was. at that 
moment, in that page frame. Thus, for the 256K-byte 
memory of the Model 40, there were 64 registers. Each 
page of virtual memory was "identified'' by a 4 bit userid 
and the 6 bits which identified the page of the virtual 
address. Only 6 bits were required because, although 
System/360 memory addresses are, architecturally. 24 
bits, just 18 bits were implemented on the Model 40. That 
is to say, the Model 40 provided a maximum address 
space of 256K bytes. During execution, CP/40 loaded the 
user-id register with the id of the current user and thus 
specified which of 16 virtual address spaces was to be 
active. Each time the processor requested data, the 6-bit 
page number and the user id were combined to form a 10- 
bit identifier of that page of virtual address space. 

To translate an address, the Model 40 compared the 
page identifier part of the virtual address with the con- 
tents of the first 10 bits of each of the 64 identifier 


- Jv 

registers. (The remaining 6 bits were used to reflect such ’''' 
things as whether the page had been referenced or v 
changed.) If a match was found, then the requested data' '" 
were located in the corresponding page of main memory. "*7 
If no match was found, then the requested page was not in 'r 
main memory. The address translation hardware interne 
rupted the processor so that the data could be transferred 
from disk to some page of main memory. 

To support a demand paging system, the Model 40 had ' 
to be further modified to detect page faults before instruct 
tion execution was started, since many System/360 in-.'i- 
structions cannot be interrupted (or backed-up). The "C 
instruction and address structure of System/360 leads toa 
situation in which, as a maximum case, data from eight ? | 
pages are required for execution of a single instruction. 'V 

The simultaneous comparison of the virtual address^? 
(identifier) with that in the 64 identifier registers was . 
performed by an associative memory system [18], which jS- 
was fast enough so that the execution speed of the Model 
40 was not reduced due to address translation. 

In addition to its importance as a prototype for two §7 
IBM products (CP/67 and VM/370), CP/40 served as a 
vehicle for experimentation on the performance aspects® 
of memory management and paging systems. One topic of S 

study was the impact on paging performance of program 
structure [19, 20]. This work led to techniques for arrang-' 
ing code modules in virtual memory so as to minimize thelSS 
number of page faults during execution. Other results 
were obtained, c.g., those from a multifactor paging ** 
experiment (in which replacement algorithms, load se- . 
quence of subroutines, set of problem programs, and r ji 
main memory were investigated) and from analysis of free (f 
storage algorithms. These are reported in [21]. 

Time sharing system — TSS/360 

The TSS/360 operating system [22, 23] was IBM’s first 
offering with virtual memory, aided by the hardware 
address transformation scheme of the IBM 360 Model 67 m 
computer. TSS/360 was developed in the 1965-67 period, 73 
and many of its memory management features became 
similar to those of the M44. We therefore restrict our- 
selves here to outlining the differences only. 

First, the dynamic address transformation was done in 
two stages, since the 24-bit (virtual) address field was 
subdivided into three subfields, containing segment, 
page, and line numbers. The 4-bit segment field was first 
used as an offset in the so-called segment table, where the 
address of the particular page table describing the pages 
of the addressed segment was found. The (middle) page 
subfield was then used to locate the corresponding real 
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naee . Within which the last 12 bits of the original (virtual) 
address were used for locating the desired byte. (See 

Fig- 4 > 

This hierarchical address organization made the total 
amount of mapping information to be stored smaller than 
the M 44 ’s one-step map; this was necessary because 
segment and page tables, without a dedicated memory, 
had to share the main memory with all other processor- 
accessible information. On the other hand, address map- 
pino became slower since two, instead of one, additional 
memory cycles were needed to find the real address. 

To improve the situation. G. A. Blauuw. at that time 
with IBM, invented a "black box" that could store in its 
content-addressable memory the mapping information 
(virtual-real page association) of several most recently 
used address associations. (M44 experimental data were 
used in the design and feasibility studies of the Blauuw 
box ) Each time a virtual address was generated, this box 
was interrogated and, more often than not, the associated 
real page number found, resulting in a tenfold speed-up 
with respect to in-memory two-stage transformation. 

A further complication was introduced by a fast drum 
device which, as back-up, was too small to support the 
many virtual address spaces, each 16 megabytes long. 
The result was a three-level memory hierarchy, consist- 
ing of main memory, drum, and disk of essentially 

unlimited capacity. 

The memory management functions of TSS thus be- 
came quite elaborate. Many innovative algorithms were 
designed and implemented to cope with the competition 
between page tables and pages containing programs and 
data. Sharing read-only information was implemented 
without duplicating pages, by mapping segments of differ- 
ent address spaces into the same pages in memory. (Each 
address space was subdivided into 16 equal-sized seg- 
ments.) Also, adaptive algorithms decided about the 
"page-out" target area-drum or disk-depending upon 
scheduling status, memory demand, current space occu- 
pied, and other factors. And for cases when available 
drum space became short, decision-making rules about 
replacement, quite similar to those between mam and 
simple back-up memory, were incorporated into the 
operating system. The subsequent migration process 
was tied to the time slice allotments controlled by the 
scheduler. 


The particular arrangement of records on drum was 
such that high transfer rates could be achieved onlyby 
combining eight (or nine) pages on a single track. This 
made necessary the introduction of "prepaging, i.e. , the 



Figure 4 The address translation technique used by the Sys- 
tem/360 Model 67. 


loading of several pages into main memory prior to the 
beginning of a time slice-a slight departure from pure 
"demand" paging. This made memory management even 
more closely coupled to the scheduling of the time-shared 
and multiprogrammed tasks. 

Multiprogramming demanded the sharing of main mem- 
ory. This was done by statically assigning areas of 
different sizes for each time slice of a task, estimated from 
the demand which had been recorded during previous 
time slices. Later this scheme was somewhat relaxed by 
"page stealing," i.e . , the reassignment of a page from one 
task to another. And if there was in memory some 
residual page left from a previous time slice, the system 
was programmed to reclaim it. 

In summary, TSS was the first comprehensive integrat- 
ed operating system built for the computers in the 360 and 
370 line which had dynamic address translation. It helped 
gather valuable experience for follow-on operating sys- 
tems with virtual memories in the early 70s. 

I 

Multiple Virtual Storage (MVS) 

This then was the situation in the early 70s. The retd 
memory-based systems were in extensive use and offered 
a great range of memory management tools which, m the 
main met the objective of high processor efficiency. 
However, the human effort-of both programmer and 
system support staff-needed" to manage memory was 
very high and represented a barrier to adding new appli- 
cations to the computer. In short, programmer productiv- 
ity had to be improved. Some years earlier, the direction 
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!>el for the virtual storage-based systems had been pro- 
grammer productivity, and the success of this approach 
demonstrated by the M/44, CP/40, TSS/360, and CP/67. 
Thus, in 1972 IBM introduced virtual storage on all of its 
processors and their operating systems. An overriding 
consideration in doing this was to minimize disruption to 
customers: Their old code must continue to run without 
change (to preserve their investment), and they must not 
have to accommodate a new operating system structure. 
The Virtual storage design approach taken was substan- 
tially that taken by the System/360 Model 67: an address 
translation extension to the System/360 architecture, 
rather than a change to that architecture. It did not affect 
any of the instructions used by application programs, and 
most of the instructions used by the operating systems 
were unchanged. This allowed the operating systems for 
System/360 to be adapted to virtual storage by adding the 
memory management functions necessary to create and 
manage a single virtual addressable space (of up to 16 
million bytes), running the operating system in that space, 
and. by demand paging, bringing only the necessary 
portions of that space into real memory. Though an 
oversimplification, the initial versions of these operating 
systems appeared to provide a 16-megabyte main memo- 
ry, when in fact the real memory was, say, only one 
megabyte. 

The problem that this approach presented was that 
acceptable performance made it mandatory that portions 
of the operating system code and data not be demand- 
paged. It was necessary to locate these “unpageable” 
portions and organize them into pages; the virtual storage 



in which the operating system ran could then be catego. 
rized as either nonpageable (<>. , fixed in real memory) or 
pageable. ’ r 

In this straightforward manner, the real memory-baseaj 
operating systems were adapted to virtual storage. DOS 
became DOS/VS, MFT became VS-I, and MVT becan 
VS-2. Each achieved its objective of improving applies 
tion programmer productivity by allowing much larged 
regions in which to place programs, with the result that 
overlay structures became simpler or, quite often, unnecS 
essary. In addition, each supported greater levels of 
multiprogramming, and many system functions otherwise 
available only on large machines were available on small- ’’ 
er ones as well. Still the maximum addressable space was ’ 
limited to 16 megabytes and had to be shared among all 
jobs. The greater exploitation of virtual storage and the' 
restructuring of the operating systems that this entailed - 
were left to subsequent versions. 

As they were the most extensive, we touch briefly on 
the exploitations of virtual storage made in the subset® 
quent versions of VS-2. The major restructuring of VS-2, 5 
called Multiple Virtual Storage (MVS) [24], provided ; 
each user with a 16-megabyte address space. (A batch job" 
or someone doing time-sharing is regarded in this context^ 
to be a user!) Recall the previous discussion of a program ■ 
as a network of named pieces and of memory manage- ^ 
ment keeping track of the subnet residing in main memo- ? 
ry. In the exploitation of virtual storage in MVS, the 
System/370 hardware took over the major part of keeping 1 
track of the resident memory subnet. When a user or ' 
system component is to be given control of the processor, d 
the MVS memory map is established so as to include the ‘ 
virtual pages assigned to that user component. 

Another important aspect of the management of multi- 
pie address spaces by MVS is that user programs were j 
effectively isolated from each other. All user address 
spaces share, at one end, the MVS supervisor and, at the - 
other, a Common Service Area (See Fig. 5). Between V; 
these lie the portions that contain programs and data , 
private to each user. Rea] memory assignment of these ■ 
pieces is by demand paging; therefore, not only can it be 
deferred until needed by the program, it can also be 7 
treated independently of other users. Importantly, much ; 
of this burden is handled by the System/370 hardware. - 
During execution no user can refer to (or store into) any 
other user’s private area. In order for one user address ( 
space to communicate with another, it had to use the fi' 
supervisor services that reside in the areas common to .' 
both. In later versions of MVS a new hardware facility 
allowing authorized direct communication between user 
address spaces was supported. 
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Two more aspects of the exploitation of virtual storage 
by MVS should be mentioned. Some system services 
(E.g. , telecommunications access methods) were assigned 
virtual address spaces. They could then be treated much 
like problem programs, with the programmer productivity 
benefits accrued to IBM's own development process. 
Finally, substantial enhancements were made to the 
resource scheduling and dispatching functions of the 
system. It was here, particularly, that the experience with 
CP/40, M44, TSS/360, and CP/67 was applied. 

Conclusions 

We conclude this review by noting two trends. First we 
note the widespread acceptance both inside and outside 
of IBM of virtual memory. One of two trends now 
apparent is extension of the System/370 architecture so 
that it takes over more of the memory management 
functions. An example of this is the recently announced 
IBM 4300 processor family, which performs much of the 
paging management for DOS/VSE. 

The second trend is to remove from the programmer 
the constraints of the linear nature of the address space. 
This is exemplified by IBM’s System 38 [25], in which the 
addressing structure allows not just an enormous address 
space (48-bit addresses yielding a 281-trillion-byte ad- 
dress space) but is also designed to directly name the 
separate pieces of the program. No longer needed are the 
programs that assign named pieces to their relative posi- 
tions in a linear space. Rather, the name given by the 
programmer to each part of a program serves as its 
address. In this manner, much of the memory manage- 
ment burden shifts to the hardware. 

How does one assess these trends? What can be said is 
that the forces that drive the change will be the shifting 
costs within the total computing system— where ‘‘sys- 
tem'' spans not just the processors and storage devices 
but includes the people who own, program, and operate 
them. It is a recognition of this total system cost which 
has governed IBM's past memory management approach- 
es and will continue to do so into the future. 
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IBM Data Communications: A Quarter Century of 
Evolution and Progress 


This paper describes the evolution of technological development in data communications at IBM . Rather than attempting 
to present a complete history, it emphasizes the changing environment and describes the more significant innovations 
that were incorporated in IBM's line of data communications products. Evolutionary developments in this area are 
traced from point-to-point batch transmission, to on-line batch communications, to interactive systems, and finally to 
networking. Although several aspects are treated, the primary focus of this account is on systems architecture, appli- 
cations, and technology. 


Introduction 

Data processing has revolutionized man's capability to 
calculate and keep records. Advances in technology have 
provided the basis for the most significant societal change 
since the industrial revolution— the information society— 
which has been brought about by combining the dramatic 
technological achievements that have occurred in data 
processing with the rapid technological evolution of tele- 
communications. 

In the last quarter century the application environment 
has changed from one where the telecommunication of 
data was basically a convenience (e.g., replacement of 
mail or couriers to transport input data and reports within 
an enterprise) to one where it has become an integral part 
of the day-to-day operation of business and government. 
In fact, there are many applications where communica- 
tions provide the necessary backbone for implementation 
(e.g. , airline reservations systems, missile command and 
control systems, information retrieval and management 
systems, automated bank teller systems). 

In earlier days, business operations had to be modified 
in order to effectively utilize the available technology. Or- 
ganizations were concerned with the relationship be- 


tween business data flow and the computer system uti- 
lized at that time. In many cases, the technology defined 
how business could process its data. In today’s world, 
technology provides the capability for a broad range of 
users, from very large international corporations to small 
independent businesses, to tailor the technology to their 
business needs rather than the converse. 

This evolution has been due primarily to the significant 
changes offered by technology; not only has the price of 
data processing technology dropped dramatically, but 
system performance capability has shown a dramatic in- 
crease as well. While the cost of data communications has 
not decreased as dramatically, the utilization of new tech- 
nology has contributed greatly to the stability of these 
costs in an inflationary environment. 

i 

Application growth and sophistication have placed 
heavy demands on technology, and the demand for con- 
tinually improved cost/performance is obvious. Of equal 
importance is the increasing demand for significant im- 
provements in system reliability and availability. When 
the base of comparison was the time it took to move data 
from one location to another by mail or by courier, any 
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performance improvement from data transmission was 
welcomed, whereas in today’s environment poor per- 
formance could have serious effects on the overall opera- 
tion of the business. 

In the last quarter century, while the ever-increasing 
use of data communications has revolutionized the data 
processing and information industries, the rate of devel- 
opment of telecommunications technology unique to data 
communications has been more limited. This is due pri- 
marily to the enormous existing base of telephone tech- 
nology designed primarily for voice usage. Today, even 
with the increasing percentage of DP systems employing 
data communications, the amount of data traffic still rep- 
resents a small portion of overall telecommunications us- 
age. although this area is by far the most rapidly growing 
segment of telecommunications. Historically, data com- 
munications capability was provided by enhancing basic 
telephone or telegraph technology: today there are many 
new telecommunications networks being developed 
around the world which are designed exclusively for data 
transmission. These developments will undoubtedly lead 
to an even more explosive growth rate of data transmis- 
sion in the years to come. 

While it is difficult to chart the orderly evolution of a 
systems technology as multifaceted as data communica- 
tions. the approach chosen here is to highlight the key 
elements of application and technology evolution. Data 
communications are traced from their beginnings in the 
early off-line point-to-point systems, through the evolu- 
tion of batch, interactive, and job-oriented terminals and 
systems, to today’s distributed processing environment. 
The evolution of systems architecture, networking, and 
software development are highlighted because of their 
critical contribution to the advancement of the overall 
technology and importance for the future. 

Point-to-point batch transmission of data 

The genesis of data communications can be traced to the 
first attempt to move data from a remote site into a com- 
puter center. From the very first, there were the problems 
of converting “languages" and of matching speeds be- 
tween two dissimilar media, and these have remained im- 
portant considerations throughout the history of data 
communications. 

Prior to 1954, although most data were batch processed 
on punched-card-oriented equipment, telegraphic com- 
munication services were oriented toward paper tape. 
Therefore, a media incompatibility problem existed: Con- 
version from cards to paper tape was required at a remote 
site and from paper tape to cards at a central processing 
392 site. A significant innovation took place in 1940 when 
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IBM delivered a tape-to-card conversion machine to the 
U.S. Air Force. This machine made it possible to enter 
data from a remote site via teleprinter equipment, receive 
the data at the central site via paper tape, and then auto- 
matically convert to punched cards for processing. A year 
later, a card-to-paper-tape machine was developed for 
use at the remote sites and, for the first time, card-to-card 
transmission— without re-keying of the data a second 
time— was possible [1]. 

In order to eliminate the need for costly card-to-tape 
and tape-to-card conversions and to be able to transmit 
data more accurately and at higher speeds, the IBM Data 
Transceiver was developed and was announced in May 
1954. This unit allowed direct card-to-card transmission 
and incorporated the following significant innovations: 

• It was the first data terminal to use punched card media. 

• It was the first commercially available data communica- 
tions terminal that could operate over leased telephone 
circuits. It also operated over telegraph and radio chan- 
nels. 

• The serial card I/O equipment was limited in speed to 17 
columns per second, which was significantly less than 
the transmission capacity of a voice-grade channel. The 
Data Transceiver was the first data terminal to ef- 
fectively utilize the bandwidth available on a telephone 
circuit by partitioning the frequency spectrum into four 
subchannels (channelization). Through the use of plug- 
gable band-pass filters spaced 500 Hz apart, the user 
could operate up to four terminals simultaneously on a 
single voice-grade channel. 

• It was the first data terminal to have a modem, in- 
tegrated within the product, capable of operating over a 
leased telephone circuit. 

• A unique 4-out-of-8-bit. fixed-count code was used in 
the Data Transceiver which permitted vertical checking 
since each code pattern transmitted had to have four 
one bits. (Telegraph transmission was based on the 
five-element Baudot code, which was unchecked for 
transmission accuracy.) 

• With channelization of a leased voice circuit, up to 68 
characters per second (cps) could be transmitted versus 
6 to 10 cps on a telegraph channel. 



i 


With the expanded use of commercial computers in the 
mid-1950s, it became desirable not only to increase the 
speed of batch data terminals but also to transmit other 
forms of media such as magnetic tape. In 1960, IBM an- 
nounced Synchronous Transmitter Receiver (STR), a 
new form of synchronous line control that was used in a ~ 
new family of terminals. 

Heretofore, with asynchronous transmission, addi- 
tional bits had to be transmitted with each character to 
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define the character boundaries for synchronization. 
Maximum line efficiency is a function of the number of 
bits representing a character and the additional start/stop 
bits for each character. With STR, character synchro- 
nization was achieved by sending a synchronization pat- 
tern once per block of characters rather than for each in- 
dividual character, thus increasing the line efficiency. 

The first STR product transmitted from magnetic tape 
to magnetic tape. (It took seven hours to transmit a full 
reel of tape between two IBM 7701 tape drives!) Later 
versions used other media, permitting transmission in 
various combinations such as card to magnetic tape, card 
to card, paper tape to paper tape, card to computer, etc. 
Several specially designed STR devices were also suc- 
cessful. such as the IBM 1974, which combined an STR 
with a 1440 computer to become IBM's first programmed 
data communications product. STR operated over dial or 
leased telephone facilities. Like the Data Transceiver, 
STR was based on a 4-out-of-8 code that permitted trans- 
mission of up to 70 different code combinations. It also 
added a longitudinal parity check for improved accuracy, 
especially when using longer block lengths. The primary 
significance of STR was that it was the first broadly de- 
fined discipline for synchronous data communications be- 
tween two devices. It not only permitted faster transmis- 
sion over telephone facilities [1200 bits per second (bps) 
at time of announcement] but also improved line effi- 
ciency by using synchronous rather than asynchronous 
control. It also provided for error checking and in- 
troduced automatic retransmission of blocks containing 
errors. 

For specialized applications, STR's speed was ex- 
tended to operate at up to 5100 cps on 48-kHz broadband 
communications channels in 1962 and later up to speeds 
of 62 500 cps on ®Te!pak [2] D 960-kHz broadband chan- 
nels. 

In the late 1950s the concept of spooling was developed 
as a way of improving overall batch throughput in a com- 
puting system. Spooling permitted copying from rela- 
tively slow input media to an output medium (e.g. , from 
magnetic tape to a printer) while a processor was per- 
forming other applications. With the introduction of Sys- 
tem/360 and its operating system OS/360, all jobs to be 
processed were rerouted through a single job queue. In- 
put readers and output writers were established as de- 
vice-dependent programs to place work in a job queue 
and to send completed jobs to the appropriate output de- 
vice. It was a logical extension of this concept to think of 
an input reader that used a telecommunications access 
method to gather input from data terminals for placement 
in the job queue, and an output writer to cause final out- 
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put to be transmitted to a terminal device. The OS/360 
Remote Job Entry (RJE) program was announced in Au- 
gust 1967. With high-speed batch terminals containing a 
card reader for input and a printer for output, it became 
possible to submit jobs remotely to a System/360 com- 
puter using OS/360 RJE. 

Until 1967, STR was the data link control used by all of 
IBM's high-speed batch terminals. The environment, 
however, was changing: The American Standard Code 
for Information Interchange (ASCII) was adopted as the 
standard interchange code by the American Standards 
Association [3], the System/360 was based on Extended 
Binary Coded Decimal Interchange Code (EBCDIC) 8-bit 
code, and users required the ability to transmit a bit 
stream that was transparent to the transmission system. 
Further, users desired to implement multipoint leased 
communications circuits in addition to point-to-point 
leased and dial facilities. Another key requirement was to 
have transmission control procedures which were code- 
independent; i.e. , logic was built around the definition of 
transmission control functions without regard to their 
specific code implementation. Because of American Na- 
tional Standards Institute (ANSI) activities, transmission 
control functions were defined to be functionally identical 
to the ANSI definitions where possible. A superset of 
functions were defined where equivalent functions did not 
exist in the ANSI definitions. In order to satisfy these re- 
quirements a new byte-oriented line control known as Bi- 
nary Synchronous Communications (BSC) was devel- 
oped. BSC is a general-purpose data-link control that pro- 
vides synchronous communications between a variety of 
terminals and transmission control units over either 
point-to-point or multipoint circuits. Some of the key con- 
trol functions contained in BSC are as follows: 

• Synchronization between the two devices; 

• Establishment procedures— polling, selection, or con- 
tention; 

• Circuit assurance — terminal identification; 

• Message transfer— including delineation and blocking; 

• Error checking— cyclic, polynomial checking or longi- 
tudinal checking, and automatic error recovery; 

• Message acknowledgment; 

• Termination procedures. 

I 

BSC supports IBM’s EBCDIC code, ASCII code, and 
fully transparent text using EBCDIC controls. IBM de- 
veloped BSC in parallel with national standards activities 
that led to the formulation of the ANSI X3.28 standard for 
data link control. v , L 

In January 1967, IBM announced the 2780 Remote Job 
Entry terminal that could transmit programs and data 
from a card reader at 400 cards per minute and print re- 





ports from processed data received by the terminal at 240 
lines per minute. It used BSC half-duplex line control, 
could operate at speeds up to 4800 bps on voice circuits, 
and was equipped with dual buffers so that transmission 
and card I/O and printing operations could be performed 
simultaneously. 

Job-specific and industry-specific terminals 

While early data base/data communications systems used 
general-purpose data terminals, it became evident that 
there were a number of applications that required unique 
terminals to meet the needs of specific growth areas. 

One of the earliest applications identified was factory 
data collection in the manufacture and process industries. 
IBM's activities began in 1951 with a joint study with 
U.S. Steel Corporation. This study led to the announce- 
ment of the IBM 9600 Automatic Production Recording 
System in 1956. Transactions could be entered from ter- 
minal devices located throughout the factory floor, which 
were then transmitted via cable to a central console 
where the data could be punched into cards or printed. 
This system was followed by the IBM 357 Data Collection 
System announced in 1959. The 357 terminals permitted 
workers to record attendance or labor transactions by en- 
tering their badges containing punched holes, or pre- 
punched cards. The transaction data were recorded cen- 
trally onto punched cards. Both the IBM 9600 and IBM 
357 were off-line systems and used multi-wire cable to 
connect the remote terminals to the central recorder. 

The next IBM data collection system was the 1030. an- 
nounced in May 1963. The 1030 system used standard 
two-wire communications facilities connecting remote 
terminal concentrators to the central computing system, 
making it the first on-line data-collection system. In addi- 
tion to operating three times as fast as the 357, it permit- 
ted two-way response from the computer back to the 
workstation. 

The finance industry was attracted to the use of on-line, 
data-communications-based systems to provide better 
service to savings deposit customers. IBM entered into a 
study with the First National Bank of Chicago in 1961 to 
define an on-line savings system, and the IBM 1060 Sys- 
tem was announced in November 1962. The 1062 Teller 
Terminal had the following features: 

• Passbook feeding and posting, 

• Alphanumeric printing. 

• A printed record of all transactions on a journal tape, 

• Two-teller operation with separate net cash accumula- 
tors and locks, and 

• Both on-line and off-line operation for increased relia- 
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In the late 1960s, a higher-function, higher-perform- 
ance system, the IBM 2972/2980, was introduced. Key to 
this system was a new controller that could service up to 
ten teller terminals. The controller communicated with 
the central computer data base over 2400-bps voice cir- 
cuits using BSC data link control. 

It became evident in the early 1970s that advances in 
solid state technology and the rapidly increasing accept- 
ance of on-line, data base/data communications systems 
made it attractive to pursue both other general-purpose 
and industry-specific applications. To provide the data 
terminal subsystems required for these new applications, 
IBM embarked on a program in 1970 to develop a family 
of "building blocks" that could be used in the multitude 
of data terminals and controllers required in the future. 

The key to the building block concept was development 
of a series of upward-compatible microcomputers, known 
as "universal controllers" or UCs. The first UC appeared 
in the IBM 3600 Finance Communication System an- 
nounced in August 1973. Since that time, six different UC 
models involving three generations of technology have 
been available as building blocks. The basic architectural 
philosophy of the UC microcomputer family includes the 
following: eight priority interrupt levels; one set of six- 
teen 32-bit registers per interrupt level; a direct-memory 
addressability to 2' 6 bytes and a virtual addressing capa- 
bility to 2 32 bytes; either direct control or channel control 
for I/O devices; and an instruction set of approximately 
250 operations [4], 

Today, the larger UCs. used in the 8100 family of dis- 
tributed processors, contain more than a megabyte of 
main memory and operate at about 400 000 instructions 
per second. The smaller, single-chip UC, used primarily 
in terminal workstations and I/O adapters, contains a sub- 
set of the UC architecture, typically has I6K to 64K bytes 
of memory and performs about 150 000 instructions per 
second. 

The key concept of the UC family is compatibility, both 
from programming and device-interface standpoints. 
Having a standard device-attachment interface has per- 
mitted IBM to develop a family of standard I/O and com- 
munications adapters as well as the I/O devices them- 
selves. Today’s data terminal developer now has a full 
range of “building block” components from which to 
configure his general-purpose or industry-specific prod- 
uct. These components include keyboards, printers, 
power supplies, diskettes, disk files, card readers and 
punches, CRT and gas-panel displays, as well as the stan- 
dard adapters to provide an interface between these de- 
vices and the microcomputer. A range of standard com- 
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munications adapters are also available to the product de- 
veloper. 

An example of how this concept is helping to satisfy the 
requirements of unique industries is seen in the IBM 3660 
Supermarket System. In this system, a UC-based in-store 
controller is equipped with electronic point-of-sale termi- 
nals that also contain UC microcomputers, displays, 
keyboards, and laser scanners. The purpose of the laser 
scanner is to automatically read the Universal Product 
Code (UPC) [5] imprinted on the label of each item. 
Knowing the code for each item, the system can automat- 
ically look up the price for that item and print it along with 
an alphabetic item description on the sales tape. 

The UPC symbol is a distinctive line pattern that ap- 
pears on all grocery items. The UPC pattern was devel- 
oped by IBM and was submitted to the supermarket in- 
dustry symbol selection committee. IBM also supported 
the industry-wide quantification of symbol print quality. 
The successful development of the UPC led to a proposal 
for a world product code, of which UPC would be a com- 
patible subset. This proposal has had a favorable re- 
sponse in Western European countries as well as in Japan 
and Australia. Many countries have established national 
coding authorities and have instituted manufacturer 
marking of symbols on both food and non-food items. 

The IBM 3687, a second-generation scanning system, 
was announced in November 1980. This system embodies 
the first commercial application of holographic tech- 
nology in IBM. Through the use of a laser and a rotating 
holographic disk, a sophisticated scan pattern is gener- 
ated which wraps around the UPC-coded item as it is 
scanned. This minimizes the need for the clerk to align 
the UPC symbol prior to scanning, thus improving the 
productivity of the checkout operation. 

Modularity and scanning capabilities have also been 
extended to banking and to plant communications sys- 
tems with the introduction of the IBM 3630 system. 

Interactive systems 

Probably the earliest occurrence of a terminal operating 
interactively with a computer over communications facili- 
ties can be traced to a demonstration conducted by Bell 
Laboratories for the American Mathematical Society in 
September 1940 [6], Bell had built a relay computer, 
which was installed in their laboratory in New York City. 
To demonstrate the computer to members of the Mathe- 
matical Society who were meeting in New Hampshire, 
they connected a teleprinter via a telegraph circuit to the 
New York City computer. 


In early 1950, the U.S. Defense Department concluded 
that a new system needed to be developed to protect the 
U.S. from a manned bomber threat. A system called 
SAGE (Semi-Automatic Ground Environment system) 
was defined by M.I.T.’s Lincoln Laboratory [7]. IBM 
was awarded the contract to develop, manufacture, in- 
stall, and maintain the central computer (AN/FSQ-7) and 
the combat control central (AN/FSQ-8). The AN/FSQ-7 
included the communications front end that accepted in- 
formation in real time via attached communications lines 
from tracking devices. Processed data were sent out over 
communications lines to defensive weapon systems. In a 
national network of Q-7s, each computer exchanged data 
with neighboring Q-7 computers and with a high-level Q- 
8. The significance of the SAGE effort was that it demon- 
strated the feasibility of merging a network of computers 
and communications to provide system solutions for com- 
plex, real-time applications [8]. 

In 1954, IBM established a joint research project with 
American Airlines to study their reservations problem 
and to define a communications-based system that would 
satisfy their requirements. A formal proposal was made 
and a contract was signed in November 1959. The major 
improvement in the proposed IBM system, known as 
Semi-Automatic Business-Related Environment (SABRE) 

[8], over other early electronic reservations systems, was 
that it handled not only the seat inventory but the passen- 
ger name records as well. The SABRE system also inter- 
connected to other airlines via telegraphic facilities to au- 
tomatically request space segments on their flights. The 
system, which became fully operational in 1964, con- 
sisted of duplexed processors, files, drums, and commu- 
nications front ends. Over 1100 agent terminals located 
throughout the country were connected to the system by 
nine full-duplex leased voice circuits. The SABRE sys- 
tem, which capitalized on our SAGE experience, con- 
tained the significant innovations described in the follow- 
ing subsections. 

• Line control , 

SABRE line control was developed for use on synchro- 
nous multipoint'leased voice circuits. Hub-go-ahead poll- 
ing was introduced. In this type of polling, the computer 
polled the furthest terminal, which then notified the next 
terminal when it could begin sending, and so on. The key 
advantage of this approach was the capability to poll or 
solicit traffic from a large number of interactive terminals 
on a multi-drop line with minimal overhead, providing an 

excellent response time. Classical polling techniques 

could not have met the response requirements established 
by American Airlines. 

• Remote multiplexers 

The SABRE system contained terminal interchange units 395 
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in 43 locations that interfaced the agents' workstations to 
the high-speed communications facility. When the agents' 
terminals and the terminal interchange unit were located 
in the same building, the terminals were cable-connected. 
Thus the terminal interchange unit acted like a clustered 
terminal controller. When the agents' terminals were re- 
motely located from the terminal interchange unit, they 
were connected by low-speed communications circuits. 
In this case, the terminal interchange unit performed a 
remote multiplexing function. 

• Agents' terminal 

The agents' terminal was the first application of the IBM 
"selectric typewriter printing mechanism as a terminal 
print unit. The application opened up the use of this 
mechanism in many other IBM terminal products and in 
those of other manufacturers. 

• Real-lime operating system 

SABRE had a requirement for a very efficient control pro- 
gram to handle the message throughput and provide the 
terminal response required. 

• Real-time front end 

The IBM 7286 Real Time Channel was developed for the 
SABRE system. This unit attached to the processor and 
had buffers so that a 36-bit word could be assembled from 
each communications line before the information was 
transferred to the processor unit. 

• Line testing project 

When the SABRE project was started, the maximum 
transmission rate was 1200 bps over leased voice circuits. 
Because of a very extensive line testing program IBM did 
in cooperation with AT&T, the rate for reliable transmis- 
sion was raised to 2000 bps and later to 2400 bps on leased 
voice-grade channels. Improvements in signal processing 
have continued and now it is possible to transmit at 
speeds up to 9600 bps on an unconditioned voice channel 
using current modem products. 

During the late 1950s. various government agencies and 
large corporations developed an interest in replacing their 
manual, torn-tape or semi-automatic message-switching 
centers with computer-controlled store-and-forward mes- 
sage-switching systems. The terminals used were largely 
telegraph keyboard printers or paper-tape readers and 
punches. While these terminals were adequate for admin- 
istrative message traffic, they were functionally in- 
adequate for data applications. Thus, in 1963, IBM an- 
nounced the IBM 1050 Data Communication System for 
general-purpose applications. Some of the features and 
innovations of the 1050 include the following. 

• Line control 

Prior to this time, these systems used asynchronous, 
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start-stop line control with teletype line control in which 
the character set was limited to the 5-bit Baudot code. Bit 
and character synchronization was accomplished, char- 
acter-by-character, by detecting the presence of a pulse at 
the start of a character and at the end of the character. 
Error detection was left to visual inspection of the re- 
ceived message. The IBM asynchronous line control per- 
mitted operation with both point-to-point and multi-point 
circuits. It used a new 7-bit. checkable code known as 
PTTC/BCD that permitted handling larger character sets 
than did the Baudot code. The new line control improved 
transmission efficiency by eliminating the FIGS and 
LTRS codes required in Baudot code. Both positive and 
negative acknowledgments were provided. Single control 
characters rather than control-character sequences were 
used, and the control structure permitted device selection 
at the receiving terminal. 

• Error retransmission 

The paper tape and card I/O were designed to back up and 
re-transmit the message in case an error was detected by 
the receiving unit. 


The terminal operated in half-duplex mode at 14.8 cps 
rather than at the maximum of 10 cps in teleprinter termi- 
nals. 

• Printer 

The printer used the IBM selectric typing element. This 
device provided superior print quality and lines wider 
than 78 characters. 

In 1965. IBM introduced the 2740/41 communicating 
typewriter terminals. Great care was taken to ensure that 
all of the functional and human factor characteristics 
found in the standard IBM selectric typewriter were re- 
tained. 

These devices, coupled with application software on 
the System/360, permitted a person to interact with a 
processor in a conversational manner. The selectric 
typewriter with its changeable typing element allowed it 
to be used for interactive computing applications such as 
apl, basic, and tso, and also for business applications. 
Because of its high print quality, the 2741 was also used 
as the terminal in text-processing applications. 

Display terminals 

The first IBM display terminal products introduced a new 
dimension to the man-machine interface; to the layman it 
was a window into the CPU, to the customer it offered 
new data entry capability with soft-copy printing, and to 
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those with vision, it was a more efficient linking of man 
and machine and one that possessed enormous potential 
for growth in customer interaction and productivity of 
computer systems. The key was developing the capability 
to enter, display, and manipulate data at terminals and the 
marriage of this capability, through communications facil- 
ities and discipline, with host programming to provide 
new and more efficient system applications. 

The 2250 display family introduced in 1964 established 
capabilities that were to stand for years in high-function 
interactive vector graphics. The 2250 display presented 
an information content of up to 2800 characters or 6000 
inches of vectors in a 1024 x 1024-dot addressable image 
space. 

The history of alphanumeric terminal development was 
auspicious. The 2260/2848 product combined TV tech- 
nology with delay lines for image refresh, to provide an 
information content of 240, 480, or 960 characters repre- 
senting approximately 34 000 dispiayable points per ter- 
minal. By today's standards, terminal functions were 
low: erase-to-end-of-line, tab-to-column, and one shared 
subsystem printer. System communication was at 1200 
and 2400 bps with start-stop protocol. This product laid 
the foundation for future inquiry and data entry appli- 
cations and became the predecessor of the first display 
terminal for airline reservations. 

The 3270 family introduced in 1972 established new 
standards in function, performance, and reliability that 
resulted in user acceptance far in excess of expectations. 
Key functions included the following: 

• The 3270 data stream, formatted for CRT applications. 

• 1920-character screen format of 24 rows by 80 charac- 
ters, providing approximately 121 000 dispiayable 
points. 

• Terminal functions such as character or line insertion or 
deletion, light pen, printer option, automatic-cursor- 
movement capability, data compaction, protected data, 
multi-level brightness, and signature recognition. 

• BSC protocols with data rates to 7200 bps. 

• Improved human factors serviceability and usability. 

• Host DB/DC programming support (CICS/IMS, 
TCAM, VTAM). 

In 1974 the plasma-gas-panel technology was introduced 
in a 480-character display for the 3600 Banking System. 

The next generation of 3270 alphanumeric products 
emerged in 1977 and introduced new functional capabili- 
ties within a framework of improved quality, service- 
ability, and human factors oriented to enhance customer 
productivity. These included 
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• Larger screen formats: 43 rows of 80 characters and 27 
rows of 132 characters, with an information content of 
approximately 360 000 dispiayable points. 

• SDLC communication protocols with signaling rates up 
to 9600 bps. 

• Enhanced character fonts. 

• Extended data stream functions: multiple presentations 
of color, character graphics, text and alphanumeric 
data; growth of device independence at the terminal. 

In 1978, introduction of the 3730 system marked the 
beginning of a text/word-processing application class of 
alphanumeric terminals. 

The 8775 terminal shipped in 1979 introduced use of a 
microcomputer to offer new flexibility in function plus a 
loop-communications attachment to distributed systems. 

In the same year, the 3250, a follow-on product to the 
2250, established new levels of cost/function in support of 
the sophisticated customer base that had developed over 
the years in the application of interactive design graphics. 

In 1980, the 3270 family of alphanumeric products were 
further enhanced by two major new terminal- and host- 
supported functions: 

• Low-cost presentation graphics using alphanumeric 
techniques which displayed over 371 000 points. 

• High-resolution seven-color displays. 

The consequence of these products is that displays 
have achieved the most prolific growth and dominant role 
of any interactive system terminal; it is a chronology of 
success. 

Network architecture and control 

As data communications networks became more com- 
plex, the interaction among their many parts became 
more critical. To control the Bow of information between 
the terminal and the application program requires coordi- 
nation among the terminal design characteristics, the ap- 
plication requirements, the operating system of the main 
computer systems, and the people who interact with the 
information (both users at the terminals and specialists 
coordinating the network operation). Among the key 
questions influencing this coordination are the following: 

• Is direct message control or a queued system needed? 

• Is the terminal to be associated with only one appli- 

cation or should an indefinite number of applications be 
accessible from the terminal? —y~r — — 

• Do the programs and data all reside in one computer 
complex or are they distributed among several? 

• What are the network size, performance, and reliability 

requirements? 397 




The possibilities in each of these areas have evolved 
substantially since the 1950s when the first systems were 
being developed. 

The development of general-purpose communications 
controllers was preceded by the introduction of special- 
purpose controllers. Special-purpose communications 
controllers of the 1950s were tightly coupled into systems 
such as the SAGE national defense system and the IBM 
7281 Real-Time Channel, which was part of the Project 
Mercury man-in-space system. Other special products of 
the early 1960s. like the IBM 7286 Real-Time Channel, 
were designed for large, complex, and special appli- 
cations and found use within large enterprises such as the 
Project MAC time-sharing experiments at M.I.T.'s Lin- 
coln Laboratory and the passenger reservations system 
tor American Airlines. These systems demonstrated the 
feasibility of interconnecting large networks in com- 
mercial applications. 

The first general-purpose communications controller 
IBM developed was the IBM 7750 Programmed Trans- 
mission Control Unit, a stored-program front end that 
could attach to the IBM 1410/7000-series systems. Some- 
time later it was felt there was a need for a stand-alone 
communications processor for store-and-forward mes- 
sage-switching applications. The IBM 7740 was an- 
nounced in June 1963 to serve this requirement. While it 
had lower line-handling capacity than the 7750. it had sig- 
nificantly greater throughput capability and attached disk 
files. 

The introduction of System/360 in 1964 helped make 
teleprocessing cost-effective for a much wider scope of 
business enterprises. The IBM 2701, 2702. and 2703 
Transmission Control Units were developed as general- 
purpose "hardwired" machines without programming ca- 
pability and most of the sophisticated control function 
was relegated to the System/360 host computer. 

Access methods 

Another class of control functions, implemented with 
software and known as telecommunications access meth- 
ods [9], were fundamentally of two types: direct control 
and queued. The Queued Telecommunications Access 
Method, QTAM, was developed in 1963. Its major design 
characteristics included compatibility with Operating 
System/360 data-management design and conventions; 
significant improvement in ease of use for large tele- 
processing systems (at that time defined as systems in the 
range of 5-100 lines): and efficient use of low-speed com- 
munications lines (less than 600 bps). CPU time, and 
CPU storage. The three major innovations in the QTAM 
design were the concept of a message control program, 


the use of a nonprocedural language (provided by OS/360 
macros) to generate control program functions, and use of 
an allocating and scheduling mechanism. 


Basic Telecommunications Access Method (BTAM), a 
lower-level access method than QTAM, was designed in 
1964 to provide more immediate or direct control. Its ma- 
jor design characteristics included compatibility with OS/ 
360 and DOS/360 data management design and conven- 
tions, use by OS QTAM, economy of CPU cycles and 
storage, and adaptability for use by higher-level packages 
and for incorporation of additional I/O devices (locally or 
remotely attached). 


BTAM was used more widely than had been originally 
planned. It was used by various higher-level programs 
(e.g. , CICS and IMS), and it supported a large number of 
telecommunications devices and some nontelecommuni- 
cations devices (e.g . , locally attached paper tape). 

By 1966 it was realized that it would be very difficult to 
incorporate BSC support into QTAM because the mix- 
ture of the new higher data rates implied by BSC (up to 
4800 bps) with the existing low data rates of start-stop 
protocols required a new buffer-management design. A 
design study completed in 1968 resulted in the in- 
troduction of TCAM (Telecommunications Access 
Method) as a replacement for QTAM. It was hoped at 
that time that TCAM would be a successful replacement 
not only for QTAM but also for BTAM. 

This hope was not fulfilled because a direct control ac- 
cess method was still needed (TCAM provided a queued 
interface, as had QTAM), but BTAM needed substantial 
improvements to match the technology advances of Sys- 
tem/370 virtual memory. Synchronous Data Link Control 
(SDLC), new programmable communications controllers, 
and the introduction of distributed processing. Virtual 
Telecommunications Access Method (VTAM) was devel- 
oped in 1972 to meet these goals and to be the primary 
access method for IBM’s overall telecommunications ar- 
chitecture plan, Systems Network Architecture (SNA). 


Communications controllers 

Before describing SNA, however, it is important to un- 
derstand the contribution of the communications con- 
troller. Remote terminals interconnected with a centrally 
located computer by telephone lines require a device 
known as a communications controller at the central site 
to prepare, transmit, and receive the data for these termi- 
nals. The technology of the late 1960s established that 
stored-program concepts could be applied to small com- 
puters. In 1972 and 1973, IBM extended these concepts 
into communications controllers with the 3704 and 3705. 
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Almost a decade earlier the 7750 programmable commu- 
nications controller had accommodated the complex tele- 
processing control functions for large and special-purpose 
systems. Now the concept of moving many of these func- 
tions outboard of the System/370 into a communications 
controller was cost-effective for large and small system 
sizes. 

The control program for the 3705, the Network Control 
Program (NCP), off-loads substantial processing from the 
System/370: it performs basic data communications func- 
tions such as routing data traffic towards its destination, 
managing and controlling the flow of data traffic to avoid 
congestion, buffering and queuing of data, scheduling 
transmission line operations (e.g. , sending data and poll- 
ing). performing various line control protocols (e.g., BSC 
and SDLC) and error-recovery procedures, and detecting 
and reporting to its access method irrecoverable errors 
and other exception conditions. 

In addition to introducing improved methods for these 
functions, the control program also is capable of emulat- 
ing the IBM 270X controllers so that applications pre- 
viously written with the BTAM access method will run 
concurrently with NCP and with VTAM- or TCAM-based 
applications. As teleprocessing requirements grew in the 
1970s, these basic communications functions expanded 
into sophisticated networking capabilities managed by 
NCP in the 3705 and VTAM or TCAM in System/370, 
coordinated by SNA. 

Systems Network Architecture (SNA) 

SNA significantly advanced the state of the art in tele- 
processing software systems [10, 1 1]. Announced in Sep- 
tember 1974 as IBM’s blueprint for future teleprocessing 
products, SNA provides a unified design for the functions 
and structure of IBM’s data communications products. It 
allows customers to establish a long-term direction as 
they plan networks of terminals and processors, both 
large and small. In this sense SNA is analogous to Sys- 
tem/360, introduced in 1964 with its uniform architecture 
for processors, which was the response to the earlier 
problems of sharing programs, I/O devices, and other re- 
sources of the central computer complex among the vari- 
ety of IBM 1400- and 7000-series computers. Prior to the 
introduction of SNA, teleprocessing networks were beset 
by many problems: Terminals were often dedicated to the 
use of a single application; numerous and diverse line- 
control procedures and terminal types were ingrained into 
the support programs, application programs, and network 
operations; and multiple access methods were in common 
use, thwarting any attempt to share resources among ap- 
plications. Each of these problems made it difficult to ex- 
pand existing applications or to add new ones. SNA was 


introduced to solve these problems and to make tele- 
processing applications easier to install, operate, and ex- 
pand. 

SNA also had its roots in the hardware technological 
advances of the early 1970s. At that time, it became eco- 
nomically possible to incorporate a small processor into 
the design of many terminals (such as the universal con- 
troller described previously). 

Prior to such microcomputers, a terminal was com- 
manded directly by its host computer. For example, each 
keystroke produced an input character transmitted inde- 
pendently at the rate of generation; and each output char- 
acter was sent at a rate not exceeding that of the printer. 
Commands such as “shift to upper case” or “carriage 
return” were handled by the host on an individual basis. 
Thus, the host computer was intimately associated with 
the detailed characteristics of each device attached to it; 
and, as the terminal network expanded, the workload 
placed upon it could become excessive. 

With the new microcomputer-based designs, the pro- 
cessor within the terminal handles many functions inde- 
pendently of the host, and the transmissions between host 
and terminal are complete messages sent at high speed. 
This reduces the processing power required at the host 
and/or allows more terminals for a host of the same size. 
A more important change, however, was in system struc- 
ture. No longer is a tight coupling between terminal and 
host needed; device control now can be placed at or near 
the end terminal and not in the host. Thus, system com- 
mands, protocols, and procedures designed for tight cou- 
pling are no longer required; instead, a new set specifi- 
cally designed for distributed processing is required. 

Just as the processor in the terminal now handles de- 
vice control, it also readily becomes an application pro- 
cessor. A simple example is found in a retail store: The 
point-of-sale terminal and an in-store controller combine 
to off-load the host by totaling individual purchases, com- 
puting tax and {change, processing credit verification, 
maintaining inventory files, and preparing summary man- 
agement reports. Again, the important point, from a sys- 
tem standpoint!, is that the application may now be per- 
formed in any of several places within a network— at the 
host, at a controller, or even at the terminal itself. 

This is a new structure that essentially did not exist be- 
fore, and definitions for the control of such a system are 
needed. The advent of distributed processing, then, 
whether for device control or distributed application pro- 
cessing, was the fundamental technical rationale behind 
the creation of SNA. 
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From an architectural point of view. SNA is a top- 
down structured design composed of layers [12, 13], The 
lowest layer, data link control, directly manages physical 
resources— the transmission facilities that connect nodes. 
Successive layers provide additional services. For ex- 
ample. the path control layer provides a routing service, 
so that its users are unaware of the physical topology of 
the network, and some nodes contain a control point that 
controls the nodes (e.g., terminals and controllers) and 
lines in their own portions of the network. Other layers 
provide services to applications: these can include trans- 
parent access to local or remote resources (e.g., dl/i 
VSAM files), mapping of data streams to and from appli- 
cation data structures (also called presentation services), 
access to other local or remote programs, management of 
bufter commitments, and encryption of data before trans- 
mission and decryption upon receipt. 

This layered view of a communications architecture 
was used in a network sponsored by the U.S. Advanced 
Research Projects Agency (ARPANET) in the late 1950s. 
SNA was the first commercial implementation of the lay- 
ered concept and has been followed by most other com- 
puter manufacturers who offer teleprocessing software 
support and by the International Standards Organization 
(ISO) in its recommendations for system interconnection 
[Id. 15], 


As was later done in public packet-switching networks. 
SNA adopted the technique of sharing the use of links by 
time-division multiplexing on a message basis. Each mes- 
sage is switched at intermediate nodes to the next link 
along its route. 


Networking 

The term networking is used to connote the concept of a 
geographical distribution of terminals (usually hundreds 
or thousands in ten to one hundred locations) working 
with application programs in computer complexes (usu- 
ally one to ten or more computers in a like number of 
locations). The pioneering networks, such as for airlines 
and banks, were devoted to a single major application, 
such as reservations or savings deposit accounting, and 
constrained the variety of attached terminals usually to 
one type of display and one type of printer. Such restric- 
tions have been largely eliminated in most networks in 
operation and being planned today. Now many appli- 
cation types, each with varying load and performance re- 
quirements, can operate concurrently; many varieties of 
displays, printers, and industry-oriented terminals can 
coexist in the same network; and each terminal no longer 
need be dedicated to a single use. Rather, it can be shared 
among applications varying from simple inquire-and-up- 
date to text processing and problem solving. 


To control the single-application networks, the tele- 
communications access methods were tightly coupled 
with the application and. when tuned for performance and 
reliability, reasonable tradeoffs could be made between 
the technical requirements and the applications. These 
tradeoffs often took the form of unique link controls, spe- 
cialized front-end processors, and extensive modifica- 
tions of the telecommunications access methods. Many of 
these, such as the examples already mentioned, in- 
troduced new concepts, but for single rather than for mul- 
tiple applications. 


As the need for more general-purpose networking capa- 
bility grew, so also did the need for codification of con- 
ceptual design so that hardware and software products 
would work in harmony, and so that each installation 
could be readily tuned for performance and reliability. 
The SNA plan, implemented in such System/370 software 
products as VTAM, TCAM, and NCP, now provides ex- 
tensive leading-edge networking capability [16-18], 


• Networking services 


The initial releases of SNA (through 1977) provided user 
services for single-host tree configurations. A primary 
service was the sharing of network resources; for ex- 
ample. multiple terminals of varying types on a single 
SDLC link could interact with any of multiple appli- 
cations in one host. This sharing is key to reducing net- 
work costs and providing flexible services. 




The major objectives in the development of SDLC with 
respect to previous link controls included bit trans- 


parency. full-duplex transmission, improved error check- 
ing. capability of operation over long-delay high-speed fa- 
cilities, and reduced protocol overhead [19-21], IBM de- 
veloped SDLC in parallel with international standards 
activities that led to the formulation of the ISO HDLC 
(High-level Data Link Control) standard. The use of a 
pause-and-retry error recovery algorithm in NCP also im- 
proved availability by allowing links to automatically re- 
cover from transient link problems without requiring op- 
erator intervention. Configuration-management services 
were provided by VTAM and TCAM, through which the 
network operator could activate and deactivate links and 
nodes, and receive notification of failures. Other services 
provided the connection between applications in the Sys- 
tem/370 and those in the terminals or controllers. 


a suosequent release of SNA in 1978 introduced mul- 
tiple-host networks [22], This included capabilities in 
which a terminal controlled by one host could access an 
application in any host in the network, and host-to-host 
sessions could also be established. The single control 
point (for session establishment and configuration ser- 
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vices) and hierarchical control were generalized to a net- 
work of multiple control points which operate as peers of 
one another. Further enhancements provided functions 
such as parallel links, transmission priority, and multiple 
active routes for data transmission. Parallel links may be 
used between adjacent nodes of a network to provide ad- 
ditional bandwidth and backup, and these parallel links 
may be logically grouped to automatically distribute traf- 
fic across the links of a group. This concept also com- 
pensates for degradation resulting from errors on any of 
the links in the group, because transmission is disrupted 
only if the last remaining link in the group fails. Network 
availability can also be increased by providing multiple 
routes between the same two points in a network, so that 
traffic can be rerouted (and disrupted sessions recon- 
nected) to avoid failing intermediate nodes or failing 
links. Multiple routes can also be useful for traffic load 
leveling. These capabilities gave SNA the complete con- 
figuration flexibility of mesh networks, as distinct from 
the former tree structures and connection of trees. 


To save costs, networks are normally designed so that 
the peak rate of traffic into the network may, at times, 
exceed the maximum network throughput. Queues in the 
network help smooth the peak loads, but flow-control 
mechanisms are necessary to prevent substantial 
throughput degradation, or even deadlocks, as the offered 
load increases beyond the network’s capacity [23, 24], 
Flow control operates by limiting the rate at which traffic 
is accepted into the network. SNA products use a flow- 
control mechanism based on pacing which allows a spe- 
cific number of message units to be sent from one end of a 
route, after a pacing response is received from the other 
end. This number is dynamically adjusted by checking 
queue depths at the nodes along the route. The dynami- 
cally adjusted pacing values provide greater throughput 
than statically defined values used in other systems. An- 
other aspect of SNA flow control is the use of message 
priorities, such that at each trunk line messages are trans- 
mitted in the order of the priority given to their respective 
sessions. Performance analysis has shown that in net- 
works experiencing high loads, predefined routing with 
pacing also may provide better performance than routing 
schemes that allow routing decisions on every message 
without establishing explicit physical routes for session 
traffic. 


The origins of many of these networking services may 
be found in older network designs, both research and 
commercial, but IBM improved many of the algorithms 
and was the first to offer such extensive total function to 
its customers. 

• Special services 

In addition to the fundamental network control and net- 


work services offered, many other essentia] and in- 
novative features are included; three of these are network 
testing and diagnosis, cryptography, and distributed data 
base sharing. 

In 1979, an innovative series of modems was in- 
troduced. Not only are the traditional modem functions 
enhanced (for example, the IBM 3865 modem is the first 
to operate at 9600 bits per second on unconditioned lines 
and has an equalization time on multipoint lines about 
five times faster than other contemporary modems) but, 
in a unique way, modem testing and diagnostic capabili- 
ties were fully integrated into the network problem-deter- 
mination facilities [25-28]. The tests intersperse diagnos- 
tic sequences with user data without disrupting ongoing 
sessions. Information provided by these modems to NCP 
and other programs can help to pinpoint errors in data 
terminal or communications facilities, or to identify unfa- 
vorable line quality trends. An operator using these facili- 
ties can initiate tests of modem pairs and the communica- 
tions line between them. 


In the early 1970s, scientists and engineers who were 
engaged in the study of solutions to security problems as- 
sociated with data processing concluded that crypto- 
graphic techniques were required to secure information 
communicated through physically unprotected networks. 
In 1972, IBM researchers completed their work in the de- 
velopment of powerful cryptographic algorithms which 
were simple enough to be implemented on a single chip 
and yet secure enough for commercial applications [29- 
32]. This effort led to the development of the Data En- 
cryption Standard Algorithm, which in 1977 was adopted 
by the National Bureau of Standards as the U.S. Federal 
Standard for Cryptography for data processing appli- 
cations [33]. 


The system design aspects of cryptography proved to 
be particularly challenging. "End-point-to-end-point” en- 
cryption [30] is;necessary; that is, in most instances en- 
cryption/decryption operations should be performed by 
the sender and receiver of the information; intermediate 
nodes, through which the information may flow, should 
not have access to non-encrypted data. The SNA archi- 
tecture proved particularly helpful in meeting these objec- 
tives, since its protocols and formats provide easily parti- 
tioned headers (which are not encrypted), and data fields 
(which are encrypted) that are unconstrained from any 
coding considerations. Other system design objectives in- 
cluded the following: Each data set or record may be en- 
crypted under a different key and each node may have a 
different cryptographic key; the cryptographic key must 
not be subject to manipulation by the control program, 
even if the control program is modified by unauthorized 401 
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personnel: and through the use of authorized hierarchical 
keys, files encrypted by one system can be read (and writ- 
ten) by any other authorized system without the necessity 
of sharing knowledge of the keys used for enciphering the 
data. In order to accomplish these objectives, a new hier- 
archical key-management scheme was devised that per- 
mits the use of system facilities without compromising the 
security of the encrypted system-key tables [30], 

As network components become more intelligent and 
sophisticated, distributed processing and distributed data 
bases will become increasingly commonplace. Processors 
introduced in the late 1970s such as the 4300. 8100, Series/ 
1. System/34, and System/38 emphasize this trend. Appli- 
cations are being developed that are comprised of pro- 
grams residing in several processors in several locations. 
These geographically dispersed programs, in turn, use 
data files distributed in several locations, often different 
from the locations of the programs. An example of such a 
system might be inventory control in which parts-avail- 
ability information is located at the production site, parts- 
on-order information is located at various receiving 
docks, vendor-billing data are processed at one central 
computer complex, and vendor-quality-control statistics 
are analyzed at another. To facilitate the coordination, 
error recovery, and efficient operation of such systems. 
SNA products include conventions for program-to-pro- 
gram operation in addition to terminal-to-program opera- 
tion. The intersystem communications capability [13, 34] 
of Information Management System (IMS), Customer In- 
formation Control System (CICS), and Distributed Pro- 
cessing Programming Executive (DPPX), announced in 
1979. provides the capability of each of these products to 
communicate with one another, to route messages based 
on content, and to support distributed data bases with the 
actual geographic location of the data masked from the 
using application programs. 

Infernal IBM networks 

The I B M Corporation is not only a producer of data com- 
munications programs and products, but is also a major 
user of these products in support of its worldwide busi- 
ness operations. In the past ten years, internal use of data 
communications has grown to the point where develop- 
ment, manufacturing, marketing, field engineering ser- 
vice, and headquarters operations cannot function 
smoothly unless the internal networks and associated ap- 
plications are available to IBM personnel worldwide. In 
the 1960s, most operating units met their needs with dedi- 
cated networks. In the 1970s many of these dedicated net- 
works were merged for reasons of economy and effi- 
ciency. 

Five very large international networks emerged from 
this consolidation. It is instructive to look at just one of 


these, the Corporate Consolidated Data Network 
(CCDN), as it is a good example of usage and growth typi- 
cal in large industrial corporations. The other networks 
[35] are similar but are operated independently of CCDN 
because of their size (e.g . , the System Communications 
Division’s network has 8000 terminals), or their unique 
geographic orientation (e.g . , for Europe, the Middle East 
and Asia) or their operating characteristics (e.g., inter- 
national bulk transmission). 


The CCDN network is a centrally managed, high-per- 
formance interactive network that supports both SNA 
and older applications. All 15 000 terminals are SNA ter- 
minals. The System/370 concentrators are located in four 
cities across the United States. They are unattended and 
are remotely managed by the System/370 Network Con- 
trol System in New York. The eighteen host application 
sites are geographically dispersed throughout the United 
States. 

The growth of CCDN usage, which has been approxi- 
mately 25% per year, can be appreciated by reviewing a 
few statistics. Let us compare the years 1975, 1978, and 
1981 for several important parameters: The number of 
terminals has increased from 4400 to 8100 to 15 400; ap- 
plication programs, from 3 to 14 to 30; and characters 
transmitted per year, from 75 to 204 to 460 billion. The 
applications available to the 40 000 CCDN users include ' 
order entry, financial analysis, proposal preparation, 
message switching, education and training, and history of 
repair actions. 

Satellite transmission 

In 1965. the Intelsat consortium successfully launched 
"Early Bird." the first geostationary communications 
satellite for telephone and television transoceanic ser- 
vice. Shortly thereafter, it was recognized that communi- 
cations satellites were significant for future data process- 
ing systems, and IBM initiated studies that led to a time- 
division multiple-access (TDMA) system to efficiently 
serve both voice and data systems. By 1973, a precursor 
IBM system known as Telesatcom, using the Canadian 
Anik Satellite, had implemented voice, data, and image 
communications using TDMA. In another test, earth sta- 
tions in Poughkeepsie, New York, and Rochester, Minne- 
sota, efficiently communicated at 1.544 Mbps. 


More extensive testing in 1977 and 1978 between sites 
in the United States, France, and Germany using the Eu- 
ropean Symphonie satellite, also at 1.544 Mbps, demon- 
strated the feasibility of distributed data processing func- 
tions such as remote job entry, tape-to-tape and disk-to- 
disk file transfer, and dynamic-network load sharing [36]. 
The success of the Telesatcom precursor led to the instal- 
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lation of an operational system in the United States, be- equipment. The first system was installed at South- 
ginning in 1978, providing IBM employees in 29 cities western Bell in St. Louis, Missouri in January 1965. 
with telephone communications via satellite. The Tele- 
satcom experience has influenced the development of a # Stored-program-controlled PABX 

highly sophisticated satellite communications controller Telephone exchanges up to the early 1960s were almost 

for Satellite Business Systems, which efficiently com- solely electromechanical. Since then semi- or fully elec- 

presses and merges voice and data streams and is the pri- tronic exchan S es have been developed and the control 

mary system component co-located with earth stations portion of these switching systems uses stored-program 

residing directly on customer premises. computers. 

IBM began a development effort on a stored-program 

Special data communications applications . . . . . . . . . . . 

r private automatic branch exchange in the mid-1960s. In 

Finally, it might be instructive to take a look at the three w Tnw ... , ^ 

3 . . May 1969 it announced the IBM 2750 Voice and Data 

special communications applications outside of the main e , .. c 

y r Switching System for marketing in five European coun- 
stream of data communications development. • • ir * e TDU 

K tries. The principal features of the IBM 2750 were 


9 Audio response 

Work in audio response can be traced back to early devel- 
opment in Endicott in 1954. A device called the “VOX- 
BOX” had a vocabulary of twelve words on a magnetic 
drum. The device could answer a telephone call and then 
respond with audio messages based on the digits dialed. 

In 1961 , a project known as Voice Answer Back (VAB) 
was initiated in Poughkeepsie. The VAB unit stored 
words or phases on tracks of a magnetic drum. Tech- 
niques were developed for digitizing the voice recording 
and expanding it or compressing it as required so that it 
occupied a full revolution of the drum when converted 
back to analog form. IBM’s first product, the IBM 7770 
Audio Response Unit, was announced in January 1964. A 
precursor unit was installed in the New York Stock Ex- 
change quotation system. IBM was the first company to 
introduce a commercially available standard audio re- 
sponse product. 

The significance of this innovation was that it 

• Added a new form of terminal output— the spoken 
word. 

• Made the standard ®Touch-Tone [2] telephone a termi- 
nal for inquiry-response applications. 

• Opened up a number of significant new applications, 
heretofore not economically feasible, such as credit 
verification, bank-balance inquiry, and telephone inter- 
cept. 

• Telephone intercept 

In 1963, work began on the development of a system for 
telephone companies that could automatically intercept 
telephone calls to non-working numbers. By looking up 
information stored in a computer data base, the IBM 7770 
Audio Response Unit could then transmit a message such 
as “the number you have called has been changed to xxx- 
xxxx.” The 2910 Automatic Intercept System [37] was 
connected directly to telephone central-office switching 


• A space-division switch using solid state crosspoints for 
the first time; 

• Stored-program control, supporting both voice and data 
applications; 

• A BSC adapter permitting communications with a Sys- 
tem/360; 

• High reliability and availability through redundant de- 
sign; 

• Attachment of a wide variety of terminal devices in ad- 
dition to telephone handsets. 

In 1972 a follow-on system, the IBM 3750, was an- 
nounced. The 3750 can handle up to 2500 lines, has net- 
working capabilities, has greater traffic-handling capacity 
than the 2750, and better reliability and diagnostic charac- 
teristics. The 1750, announced in 1979, handles from 100 
to 760 lines. 

Conclusion 

The last twenty-five years have seen significant changes 
in data communications. At first, it seemed that only un- 
usual applications required data to be moved rapidly from 
one place to another. Today, we have airline reservations 
systems, banking systems, and data communications net- 
works with terminals in stores, offices, and factories. In 
the U.S. work force in 1980, there is a terminal for every 
48 employees. Among IBM customers there are 25 em- 
ployees per terminal. And within IBM itself we have 
about five employees per terminal. By the mid-1980s, it is 
estimated there will be a terminal for every ten employees 
in the U.S. work force: one for every six of our custom- 
ers’ employees, and one for every two IBM employees 
[38], 

The changes in technology have been as spectacular as 
the growth in numbers. Now we have interactive as well 
as batch processing; shared and dedicated lines; and high- 
speed, synchronous as well as slow-speed, start-stop data 
transmission. Many uses remain for keyboard input and 
printer output, but laser scanners and visual displays, and 403 
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even audio response, make it easier for people to use data 
communications systems. Hardwired, inflexible terminals 
are being replaced by distributed processing with pro- 
grammable terminals. 

In the remaining two decades of the century, conve- 
nient terminals equipped with display screens— not re- 
quiring special skills for operation— should move into 
many offices, schools, and then into homes. Distributed 
data bases will allow people ready access to information 
and special services of many types. Data, voice, and fac- 
simile services will probably become available from a 
single all-digital system. 
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Real-Time Systems for Federal Applications: A Review 
of Significant Technological Developments 


The Federal Systems Division of IBM has been heavily involved in complex, on-line, real-time systems for over twenty- 
five years. In this paper a representative sample of these programs are reviewed, and an evaluation of the significant 
lessons learned from this wealth of experience is presented. The key issues which differentiate real-time systems from 
their more conventional data processing counterparts are identified and their implications are discussed. This leads to 
some conclusions regarding the kind of commitment that is necessary in order to succeed in the area of real-time appli- 


Introduction 

The basic mission of the Federal Systems Division of 
IBM (FSD) is to serve the special data and signal process- 
ing needs of the federal government. Some of these needs 
can be satisfied with standard products, but other require- 
ments can only be met with a combination of hardware, 
software, and services configured to satisfy the user’s 
applications. 

Fulfillment of FSD’s mission has involved a broad 
range of challenging applications and technology, yet 
there is a common thread among these widely varied pro- 
grams: FSD has consistently been involved in on-line, 
real-time applications. Real-time data processing is char- 
acterized by highly structured and repetitive tasks, high 
speed, stability considerations for closed-loop process- 
ing, automated fault detection and decision making, and 
customized input and output. The following sections of 
this paper describe programs that trace the evolutionary 
progress of FSD in real-time systems. An overview of 
representative FSD systems is given and the FSD design 
environment is addressed. This is followed by sections on 
technological implications, real-time digital processing 
technology, reliability and availability, and total responsi- 
bility. The last section of the paper describes the LAMPS _ 


(Light Airborne Multi-Purpose System) program to illus- 
trate the nature of one aspect of the division’s current 
activity. 

Twenty-five years of challenge: An overview of 
representative FSD systems 

FSD was launched in 1955 as the Military Products Divi- 
sion of IBM. At that time it had two major contracts: the 
data processing centers for the SAGE (Semi-Automatic 
Ground Environment) air defense system, and the on- 
board AN/ASQ-38 navigation and bombing system for the 
B-52 aircraft. These programs involved real-time systems 
that were significant technological advances over any- 
thing preceding them, and they were representative of the 
great majority of FSD programs ever since. For example, 
SAGE was the first, of many complex ground-based com- 
mand and control systems for FSD. Subsequent FSD pro- 
grams in this category include 

• SABRE (Semi-Automatic Business-Related Environ- 
ment)— American Airlines’ on-line interactive reserva- 
tions system [1]. 

• IBM 9020— the multiprocessing computer system that 
is the core of the FAA’s National Airspace Systemfor 


Copyright 1981 by International Business Machines Corporation. Copying is permitted without payment of royalty provided that (1) 
each reproduction is done without alteration and (2) the Journal reference and IBM copyright notice are included on the first page. 
The title and abstract may be used without further permission in computer-based and other information-service systems. Permission 
to republish other excerpts should be obtained from the Editor. 


IBM j. RES. DEVELOP. • VOL. 25 • NO. 5 • SEPTEMBER 1981 


P. F. OLSEN AND R. J. ORRANGE 


